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(57) Abstract 

A novel polypeptide, 
designated neurotrophic fac- 
tor-4 (NT-4), has been ident- 
ified by PCR amplification of 
human genomic DNA. Pro- 
vided herein is nucleic acid 
encoding NT-4 useful in diag- 
nostics and in the recombi- 
nant preparation of NT-4. Al- 
so provided herein are nucleic 
acids encoding naturally oc- 
curring amino acid sequence 
variants of NT-4, designated 
NT-4P, NT-4y, and NT-4A. 
The neurotrophic factors of 
the invention are useful in the 
treatment of nerve cells and in 
diagnostic assays. 
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NOVEL neurotrophic factor 

Field of the Invention 

This application relates to proteins which are involved in the growth, regulation or 
maintenance of nervous tissue. In particular, it relates to a nerve-derived factors having 
5 homology to NGF. 

Background of the Invention 

Nerve growth factor (NGF) is a protein which has prominent effects on developing 
sensory and sympathetic neurons of the peripheral nervous system. NGF acts via specific 
' , cell surface receptors on responsive neurons to support neuronal survival, promote neurite 
10 outgrowth, and enhance neurochemical differentiation. NGF actions are accompanied by 
alterations in neuronal membranes (Connolly et al., 1981, J. Cell. Biol. 90:176; Skaper and 
Varon, 1980, Brain Res. 197 :379), in the state of phosphorylation of neuronal proteins (Yu, 
et a!., 1980, J. Biol. Chem. 255 :10481: Haleqoua and Patrick, 1980, Cell 22:571), and in 
the abundance of certain mRNAs and proteins likely to play a role in neuronal differentiation 
15 and function (Tiercy and Shooter, 1986, J. Cell. Biol. 1 03:2367). 

Forebrain cholinergic neurons also respond to NGF and may require NGF for trophic 
support. (Hefti, 1986, J, Neurosci., 6:2155). Indeed, the distribution and ontogenesis of 
NGF and its receptor in the central nervous system (CNS) suggest that NGF acts as a target- 
derived neurotrophic factor for basal forebrain cholinergic neurons (Korsching, Nov/Dec 1 986, 
20 Trends in Neuro. Sci., pp 570-573). 

While a number of animal homologues to NGF have become known, it was not until 
recently that an apparently distinct nerve growth factor was identified that nonetheless bears 
some homology to NGF (Leibrock et al., 1989, Nature 341 :149). This factor, called brain- 
derived neurotrophic factor IBDNF), now also called NT-2, was purified from pig brain, and 
25 a partial amino acid sequence determined both from the N-terminal end and from fragments 
purified after cleavages. The longest sequence, compiled from several overlapping fragments, 
was used to synthesize two sets of oligonucleotides that were used to prime the amplification 
of a pig genomic template using the polymerase chain reaction (PCR). The nucleotide 
sequence between the two primers was determined and used to synthesize specific primers 
30 for further PCRs on a complementary DNA template obtained by reverse transcription of total 
RNA isolated from the superior colliculus of the pig brain. The nucleotide sequence so 
obtained contained an open reading frame coding for a protein of 252 amino acids, starting 
with the first methionine codon found after four in-frame stop codons. Leibrock, et al. 
speculate that there is no reason to think that BDNF and NGF should be the only members 
35 of a family of neurotrophic proteins having in common structural and functional 
characteristics, and the authors hope that these common structural features could be used 
to aid the discovery of other members. 
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More recently, anpther novel neurotrophic factor closely related to and BDNF 

was discovered, called neuronal facttir' (NF), or neurotrophin-3 (NT-3). (Hohn, et a)., 1990, 
Nature 344:339: Maisonpierre, et a!., 1990, Science 247:1446; Rosenthal, et al., 1990, 
Neuron 4:767. Both BDNF and NT-3 share approximately 50% of their amino acids with 
5 /ttslGF. High levels of mRNA coding for BDNF and' NT-3 occur in the adult rodent brain. 
/?NGF, BDNF, and NT-3 support survival of selected populations of chick sensory neurons, 
suggesting independent roles in the regulation of neuronal survival during development. 

, Neuronal survival and growth is also affected by growth factors for non-neuronal cells, 
including fibroblast growth factor (FGF), epidermal growth factor, and insulin-like growth 
10 factors. (Morrison, et al., 1987, Science 238 :72: Waticke, 1988, J. Neurosci. 8:2618; 
Bhat, 1983, Dev. Brain Res. 11:315}. Basic FGF (bFGF) supports initial survival and 
subsequent fiber outgrowth of dissociated rodent fetal neurons in culture. While neurons 
from many brain regions are affected, the proportion of neurons surviving varies among brain 
regions, suggesting that subpopulations of neurons are responsive to bFGF. (Morrison, et ah, 
15 1986, Proc. Natl. Acad. Sci. 83:7537; Walicke, et al., 1986, Proc. Natl. Acad. Sci. USA 
S3:3012). Since bFGF lacks a signal sequence typical for released proteins, and since bFGF 
levels present in the brain are much larger than those of £NGF and BDNF, it has been 
questioned whether bFGF plays a physiological role as neurotrophic factor and has been 
proposed that bFGF acts as "injury factor* 1 released in events involving cellular destruction. 
20 (Thoenen, et al., 1987, Rev. Physiol. Biochem. Pharmacol. 109 :145). 

Another neurotrophic factor having potential therapeutic use for peripheral nervous 
system disorders, ciliary neurotrophic factor {CNTFJ, has been cloned and expressed. (Lin, 
etal., 1989, Science, 24jg: 1023). CNTF, which was purified from adult rabbit sciatic nerves, 
acts on the peripheral nervous system and appears to be completely unrelated to NGF. 
25 It is an object to identify a fourth neurotrophic factor in the NGF family and to obtain 

nucleic acid encoding such a factor. 

It is another object to synthesize such a new factor in recombinant cell culture. 
It is yet another object to provide variants and modified forms of such a new factor. 
It is an additional object to prepare immunogens for raising antibodies, as well as to 
30 obtain antibodies, capable of binding such a new factor or variant or modified form thereof. 

Another object is to provide diagnostic and therapeutic compositions comprising such 
a new factor or variant or modified forms thereof, and methods of therapeutic treatment. 

Summary of the Invention 
These and other objects of the invention apparent to the ordinary artisan are 
35 accomplished by first providing a nucleic acid sequence comprising at least a portion of the 
coding sequence for a new nerve-derived factor related to NGF, BDNF, and NT-3, hereafter 
termed neurotrophic factor-4 (NT-4). 

In one aspect, the invention provides an isolated nucleic acid encoding NT-4. In 
another aspect, the invention provides a vector comprising this nucleic acid. In a third 
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aspect, the invention supplies a recombinant host cell comprising this nucleic acid. In yet 
another aspect, the invention furnishes a composition comprising NT-4 from an animal 
species, which composition is free of contaminating polypeptides of that animal species. 

The nucleic acid encoding NT-4 is also used in hybridization assays to identify and to 
5 isolate nucleic acids having substantial sequence homology to the nucleic acid encoding NT-4 . 

NT-4 or fragments thereof (which also may be synthesized by in vitro methods) are 
fused (by recombinant expression or in vitro covalent methods) to an immunogenic 
polypeptide and this, in 1 turn, is used to immunize an animal in order to raise antibodies 
' , against an NT-4 epitope. ,Anti-NT-4 is recovered from the serum of immunized animals. 
10 Alternatively, monoclonal antibodies are prepared from cells of the immunized animal in 
conventional fashion. Antibodies identified by routine screening will bind to NT-4 but will not 
substantially cross-react with NGF, BDNF, or NT-3. Immobilized anti-NT-4 antibodies are 
useful particularly in the diagnosis (in vitro or in vivo ) or purification of NT-4. 

Substitutional, deletional, or insertional mutants of NT-4 are prepared by in vitro or 
15 recombinant methods and screened for immuno-crossreactivity with NT-4 and for NT-4 
antagonist or agonist activity. 

NT-4 also is derivatized in vitro in order to prepare immobilized NT-4 and labelled NT-4, 
particularly for purposes of diagnosis of NT-4 or its antibodies, or for affinity purification of 
NT-4 antibodies. 

20 NT-4, or a variant or modified form thereof, or anti-NT-4 antibody is formulated into 

physiologically acceptable vehicles, especially for therapeutic use. Such vehicles include 
sustained-release formulations. 

In another aspect, the invention provides a method for producing NT-4, or a variant or 
modified form thereof, comprising culturing a transformed host cell and recovering the desired 
25 polypeptide from the host cell culture. 

NT-4 has been found to have a broad tissue distribution and is structurally related to 
NGF, BDNF, and NT-3. Its presence in the brain and muscle tissue indicates that it may be 
useful as a therapeutic agent for neurodegenerative diseases and damaged nerve cells, e.g., 
nerves damaged as a result of trauma. 
30 Therefore, in another aspect, the invention provides a method for treating a 

neurodegenerative disease or damaged nerve cells comprising administering to a mammal an 
effective amount of NT-4, or a variant or modified form thereof. 

Brief Description of the Drawings 
Figure 1 shows the partial nucleotide sequence for the human NT-4 gene (SEQ ID NO. 
35 1 ) and the deduced amino acid sequence (SEQ ID NO. 2), including the entire nucleotide and 
amino acid sequences for mature human NT-4. The arrow indicates where the mature 
sequence begins, the asterisk indicates where the sequence begins for calculating homology 
with other members of the neurotrophic factor family, and the stop codon is circled. The 
amino acids are numbered from the N-terminus of the mature region. 
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Figure 2 shows the homologies among the amino acid sequences of human NT- 2 (SEQ 
ID NO. 3), NT-3 (SEQ ID NO. 4), and NGF (SEQ ID NO. 5), and the mature and partial 
precursor portion of NT-4 (SEQ ID NO. 6). The locations of the sense (NGX-54) and 
antisense (AR1) primer sites on the sequence are marked with vertical solid arrows, and the 
5 start of the mature region is indicated with an arrow. 

Figure 3 shows the nucleotide sequence of a cDNA encoding a portion of human NT-4/? 
(SEQ ID NO. 7), and the deduced amino acid sequence of this portion of NT-40 (SEQ ID NO. 
8). i 

Figure' 4 shows the nucleotide sequence of a genomic DNA encoding human NT-4}/ 
10 {SEQ ID NO. 9), and the deduced amino acid sequence (SEQ ID NO. 10). The first in-frame 
Met residue is located at nucleotide positions 356-358, and is the putative start codon of 
human NT-4)/. 

Figure 5 shows the nucleotide sequence of a genomic DNA encoding human NT-4A 
{SEQ ID NO. 11), and the deduced amino acid sequence of this portion'of 
15 NT-4 A {SEQ ID NO. 12). 

Figure 6 shows the homologies among the amino acid sequences of human NT-4, NT- 
4/?, NT-4k, and NT-4 A. The arrow indicates where the sequence of mature human NT-4 
begins. 

Detailed Description of the Preferred Embodiments 

20 As used herein, "NT-4 B refers to a polypeptide having the amino acid sequence shown 

in Figure 1 for mature human NT-4, amino acid sequence variants of such polypeptide, 
peptide fragments of mature human NT-4 and said amino acid sequence variants, which 
peptides are at least about 5 amino acids in length and comprise an immune epitope or other 
biologically active site of the corresponding polypeptide, and modified forms of mature human 

25 NT-4 and said amino acid sequence variants and peptide fragments wherein the polypeptide 
or peptide has been covalently modified by substitution with a moiety other than a naturally 
occurring amino acid; provided, however, that the particular amino acid sequence variant, 
peptide fragment, or modified form thereof under consideration is novel and unobvious over 
the prior art, and is not NGF, BDNF, or NT-3 of any animal species or any fragment or 

30 modified form of such NGF, BDNF, or NT-3. 

NT-4 nucleic acid is RNA or DNA which encodes a NT-4 polypeptide or which 
hybridizes to such DNA and remains stably bound to it under stringent conditions and is 
greater than about 1 0 bases in length; provided, however, that such hybridizing nucleic acid 
is novel and unobvious over any prior art nucleic acid including that which encodes or is 

35 complementary to nucleic acid encoding NGF, BDNF, or NT-3. Stringent conditions are those 
which (1) employ low ionic strength and high temperature for washing, for example, 0.15 M 
NaCi / 0.015 M sodium citrate / 0.1 % NaDodS0 4 at 50°C, or (2) use during hybridization a 
denaturing agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% 
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bovine serum albumin/0 .'1 % FicoH / 0.1% polyvinylpyrrolidone / 50rriM sodium phosphate 

buffer at pH 6.5 with 750 mM NaCI, 75 mM sodium, citrate at 42°C. 

DNA encoding NT-4 is obtained from brain tissue cDNA libraries, or genomic DNA, or 

by in vitro synthesis. Hybridizing nucleic acid generally is obtained by in vitro synthesis. 

5 Identification of NT-4 DNA most conveniently js accomplished by probing human cDNA or 

i i 'i 

genomic libraries by labeled oligonucleotide sequences selected from the Figure 1 sequence 

in accord With known criteria, among which is that the sequence should be of sufficient 

length and sufficiently unambiguous that false positives are minimized. Typically, a 32 P- 

labeled oligonucleotide having about 30 to 50 bases is sufficient, particularly if the 

10 oligonucleotide contains one or more codons for methionine or tryptophan. Isolated nucleic 
acid will be DNA that is identified and separated from contaminant nucleic acid encoding 
other polypeptides from the source of nucleic acid. The nucleic acid may be labeled for 
diagnostic purposes. > 

Amino acid sequence variants of NT-4 are polypeptides having an amino acid sequence 

15 which differs from that shown in Figure 1 for mature human NT-4 by virtue of the insertion, 
deletion, and/or substitution of one or more amino acid residues within the Figure 1 sequence. 
Amino acid sequence variants generally will be about 75% homologous (and often greater 
than 85% homologous) to mature human NT-4 based on a comparison of the amino acids 
present at each position within the sequences, after aligning the sequences to provide for 

20 maximum homology. 

Amino acid sequence variants of NT-4 may be naturally occurring or may be prepared 
synthetically, such as by introducing appropriate nucleotide changes into a previously isolated 
NT-4 DNA, or by in vitro synthesis of the desired variant polypeptide. As indicated above, 
such variants will comprise deletions from, or insertions or substitutions of, one or more 

25 amino acid residues within the amino acid sequence shown for mature human NT-4 in Figure 
1 . Any combination of deletion, insertion, and substitution is made to arrive at an amino acid 
sequence variant of NT-4, provided that the resulting variant polypeptide possesses a desired 
characteristic. The amino acid changes also may result in further modifications of NT-4 upon 
expression in recombinant hosts, e.g. introducing or moving sites of glycosylation, or 
30 introducing membrane anchor sequences (in accordance with PCT WO 89/01041 published 
Feb. 9, 1989). 

Preferably, an amino acid sequence variant of NT-4 that is naturally occurring, 
including, for example, a naturally occurring allele, will be produced by recombinant means 
by expressing in a suitable host cell genomic DNA or cDNA comprising the nucleotide coding 
35 sequence for such naturally occurring variant. Other amino acid sequence variants of NT-4 
will be produced by making predetermined mutations in a previously isolated NT-4 DNA. 
There are two principal variables to consider in making such predetermined mutations: the 
location of the mutation site and the nature of the mutation. In general, the location and 
nature of the mutation chosen will depend upon the NT-4 characteristic to be modified. For 



BNSDOCID: <WO 9205254A1J, ; 



WO 92/05254 



PCT/US91/06950 > 



■6- 

example, candidate NT-4 antagonists or super agonists' initially will be selected by locating 
amino acid residues that are identical or highly conserved among NGF, BDNF, NT-3, and NT- 
4. Those residues then will be modified in series, e.g., by (1) substituting first with * 
conservative choices and then with more radical selections depending upon the results 
5 achieved, (2) deleting the target residue, or (3) inserting residues of the same or different * 
class adjacent to the located site, or combinations of options 1-3. 

One helpful technique is called "ala scanning". Here, an amino acid residue or group 
of target residues are identified and substituted by alanine or polyalanine. Those domains 
, demonstrating functional sensitivity to the alanine substitutions then are refined by 

10 introducing further or other variants at or for the sites of alanine substitution. 

Obviously, such variations which, for example, convert NT-4 into NGF, BDNF, or NT-3 
are not included within the scope of this invention, nor are any other NT-4 variants or 
polypeptide sequences that are not novel and unobvious over the prior art. Thus, while the 
site for introducing an amino acid sequence variation is predetermined, the nature of the 
15 mutation p_er se need not be predetermined. For example, to optimize the performance of a 
mutation at a given site, ala scanning or random mutagenesis is conducted at the target 
codon or region and the expressed NT-4 variants are screened for the optimal combination 
of desired activity. 

Amino acid sequence deletions generally range from about 1 to 30 residues, more 

20 preferably about 1 to 10 residues, and typically are contiguous. Deletions may be introduced 
into regions of low homology among BDNF, NGF, NT-3, and NT-4 to modify the activity of 
NT-4. Deletions from NT-4 in areas of substantial homology with BDNF, NT-3, and NGF will 
be more likely to modify the biological activity of NT-4 more significantly. The number of 
consecutive deletions will be selected so as to preserve the tertiary structure of NT-4 in the 

25 affected domain, e.g., beta-pleated sheet or alpha helix. 

Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions ranging 
in length from one residue to polypeptides containing a thousand or more residues, as well 
as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions 
(i.e., insertions within the mature NT-4 sequence) may range generally from about 1 to 10 

30 residues, more preferably 1 to 5, most preferably 1 to 3. An example of a terminal insertion 
includes fusion of a heterologous N-terminai signal sequence to the N-terminus of the NT-4 
molecule to facilitate the secretion of mature NT-4 from recombinant hosts. Such signals 
generally will be homologous to the intended host cell and include STII or lpp for E. coli , alpha 
factor for yeast, and viral signals such as herpes gD for mammalian cells. Other insertions 

35 include the fusion of an immunogenic polypeptide such as a bacterial or yeast protein to the 
N- or C-termini of NT-4. 

The third group of variants are those in which at least one amino acid residue in NT-4, 
and preferably only one, has been removed and a different residue inserted in its place. An 
example is the replacement of arginine and lysine by other amino acids to render the NT-4 

BNSDOCID: <WO ..9205254A1 J_> 



WO 92/05254 



PCT/US91/06950 



-7- 

resistent to proteolysis by serine proteases, thereby creating a variant of NT-4 that is more 
stable. The sites of greatest interest for substitutional mutagenesis include sites where the 
amino acids found in BDNF, NGF, NT-3, and NT-4 are substantially different in terms of side 
chain bulk, charge or hydrophobicity, but where there also is a high degree of homology at 
5 the selected site within various animal analogues of NGF, NT-3, and BDNF (e.g., among all 
the animal NGFs, all the animal NT-3s, and all the BDNFs). This analysis will highlight 
residues that may be involved in the differentiation of activity of the trophic factors, and 
therefore, variants at these sites may affect such activities. Examples of such sites in mature 
human NT-4, numbered from the N-terminal end, and exemplary substitutions include NT-4 

10 (G78~>K, H, Q or R) (SEQ ID NOS. 13, 14, 15, and 16, respectively) and NT-4 (R85~>E, 
F, P, Y or W) (SEQ ID NOS. 17, 18, 19, 20, and 21 , respectively). Other sites of interest are 
those in which the residues are identical among all animal species' BDNF, NGF, NT-3, and NT- 
4, this degree of conformation suggesting importance in achieving biological activity common 
to all four factors. These sites, especially those falling within a sequence of at least 3 other 

1 5 identically conserved sites, are substituted in a relatively conservative manner. Such 
conservative substitutions are shown in Table 1 under the heading of preferred substitutions. 
If such substitutions result in a change in biological activity, then more substantial changes, 
denominated exemplary substitutions in Table 1 , or as further described below in reference 
to amino acid classes, are introduced and the products screened. 

20 Table 1 



Original Residue 


Exemplary 


Preferred 




Substitutions 


Substitutions 


Ala (A) 


val; leu; He 


val 


Arg (R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gty (G) 


pro 


pro 


His (H) 


asn; gin; lys; arg; 


arg 


He (I) 


leu; val; met; ala; phe; 
norleucine 


leu 


Leu (L) 


norleucine; iie; val; 
met; a!a; phe 


ile 


Lys (K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala 


leu 


Pro (P) 


sly 


gly 
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Ser (S) ' thr thr . ' 

Thr (T) ser ser 

Trp (W) tyr tyr 

Tyr (Y) trp; phe; thr; ser phe 

5 Val (V) ile; leu; met; phe; 

ala; norleucine . leu •■ 

Sites particularly suited for conservative substitutions include, numbered from the N- 

terminus of the mature human NT-4, R1 1, G12, E13, VI 6, D18, W23, V24, D26, V40, L41, 

Q54, Y55, F56/E58, T59/G77, R79, G80, H85, W86, A99, L100, T101, WHO, R111, 

1 0 W1 12, 1113, R1 1 4, 1115, D1 1 6, and A1 1 8. Cysteine residues not involved in maintaining 

the proper conformation of NT-4 also may be substituted, generally with serine, in order to 

improve the oxidative stability of the molecule and prevent aberrant crosslinking. Sites other 

than those set forth in this paragraph are suitable for deletional or insertional studies generally 

i ■ 

described above. t ( 

15 Substantial modifications in function or immunological identity are accomplished by 

selecting substitutions that differ significantly in their effect on maintaining (a} the structure 
of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the 
bulk of the side chain. Naturally occurring residues are divided into groups based on common 
20 side chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

{4) basic: asn, gin, his, lys, arg; 
25 (5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes 

for another. Such substituted residues also may be introduced into the conservative 

substitution sites set forth above or, more preferably, into the remaining (non-conserved) 
30 sites. 

Examples of NT-4 variants include NT-4(E67-->S or T) (SEQ ID NOS. 22 and 23, 
respectively) (this adds an N-linked glycosylation site); NT-4(R83-Q94) (SEQ ID NO. 24); NT- 
4(G1-C61) (SEQ ID NO. 25) (variants so depicted are fragments containing the residues 
indicated); NT-4(G 1 -CI 7) (SEQ ID NO. 26); NT-4(C1 7-C61 ) (SEQ ID NO. 27); NT-4(C1 7-C78) 
35 {SEQ ID NO. 28); NT-4(C17-C90) (SEQ ID NO. 29); NT-4(C17-C1 1 9) (SEQ ID NO. 30); NT- 
4(C1 7-C1 21 ) (SEQ ID NO. 31 ); NT-4(R1 1-R27) (SEQ ID NO. 32); NT-4(R1 1 -R34) (SEQ ID NO. 
33); NT-4(R34-R53) (SEQ ID NO. 34); NT-4<C61-C78) (SEQ ID NO. 35); NT-4(R53-C61 ) (SEQ 
ID NO. 36); NT-4(C61-C1 19) (SEQ ID NO. 37); NT-4(C61-C78) (SEQ ID NO. 38); NT-4(C78- 
C1 19) (SEQ ID NO. 39); NT-4(C61-C90) (SEQ ID NO. 40); NT-4(R60-C78) (SEQ ID NO. 41); 
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NT-4(K62-Cil9)'(SEQ lb NO. 42); NT-4(K62-K9U (SEQ ID NO. 43);'NT-4(R79-R98) (SEQ 
ID NO. 44); NT-4(R83-K93) (SEQ ID NO. 45); NT-4<T101-R1 11) (SEQ ID NO. 46); NT-4(G1- 
C121) VLTVKRVRR (SEQ ID NO. 47); NT~4(V40-C121 ) VLTVKRVRR (SEQ ID NO, 
48); NT-4(V40-C121) S L T I K R I R A (SEQ ID NO. 49); NT-4(V40-C1 21 ) TLSRKAGR 
5 R A (SEQ ID NO. 50); DDDSPIARRG EI,S VCDSVSDWVSAPDKDTAVD 
IKGDDVMVLKK V*G INHSV NT-4(V40-C1 21 ) (SEQ ID NO. 51); hNGF{S1-V48) NT- 
4(V40-C12'1) hNGF(V109-A!20) (SEQ ID NO. 52); NT-4(AC78) (SEQ ID NO. 53); NT- 
4(AC61) (SEQ ID NO. 54); NT-4(AQ54-AT59) (SEQ ID NO. 55) (variants depicted in this 
fashion comprise deletions of the indicated span of residues, inclusive); NT-4(AR60-AD82) 
1 0 (SEQ ID NO. 56); NT-4(AH85-AS88) (SEQ ID NO. 57); NT-4(AW86-AT1 01 ) (SEO ID.NO. 58); 
NT-4(R53-->H) (SEQ ID NO. 59); NT-4(K91~>H) (SEQ ID NO. 60); NT-4(V108-->F) (SEQ 
ID NO. 61); NT-4(R84-->Q, H, N, T, Y'or W) (SEQ ID NOS. 62, 63, 64, 65, 66, and 67, 
respectively); and NT-4(D1 16— >E, N, Q, Y, S or T) (SEQ ID NOS. 68, 69, 70, 71, 72, and 
73, respectively). 

1 5 Also included is NT-4 wherein position 70 is substituted with an amino acid residue 

other than G, E, D or P; position 71 with other than A, P or M; and/or position 83 with other 
than R, D, S or K; as well as cyclized NT-4 fragments, including cyclic polypeptides 
comprising the sequences IKTG (SEQ ID NO. 74), EIKTG (SEQ ID NO. 75), EIKTGN (SEQ ID 
NO. 76), SPV, SPVK (SEQ ID NO. 77), HQV, KSS, KSSA (SEQ ID NO. 78), YAEHKS <SEQ ID 

20 NO. 79), RYAEHKS (SEQ ID NO. 80), RYAEHKSH (SEQ ID NO. 81), YAEHKSH (SEQ ID NO. 
82), ANRTS (SEQ ID NO. 83), NRT, ANRT (SEQ ID NO. 84), NRTS (SEQ ID NO. 85), KEA, 
KEAR (SEQ ID NO. 86), KEARP (SEQ ID NO. 87), IDDK (SEQ ID NO. 88), SENN (SEQ ID NO. 
89), TSENN (SEQ ID NO. 90), TSENNK (SEQ ID NO. 91) or KLVG (SEQ ID NO. 92). 

Also within the scope hereof are BDNF, NT-3, and NGF amino acid sequence variants 

25 having analogous structures to the NT-4 variants set forth herein. For example, the 
ana)ogous positions of NGF, NT-3, and BDNF are substituted with a residue other than D, E, 
or P, respectively, in analogy to the same mutation at position 70 of NT-4. 

DNA encoding amino acid sequence variants of NT-4 may be isolated from a natural 
source (in the case of naturally occurring amino acid sequence variants) or may be prepared 

30 by site-specific mutagenesis of DNA that encodes an earlier prepared variant or a nonvariant 
version of NT-4. Site-specific mutagenesis allows the production of NT-4 variants through 
the use of specific oligonucleotide sequences that encode the DNA sequence of the desired 
mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer 
sequence of sufficient size and sequence complexity to form a stable duplex on both sides 

35 of the deletion junction being traversed. Typically, a primer of about 20 to 25 nucleotides 
in length is preferred, with about 5 to 10 residues on both sides of the junction of the 
sequence being altered. In general, the technique of site-specific mutagenesis is well known 
in the art, as exemplified by publications such as Adelman, et al., 1983, DNA 2:1 83. 
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As will be appreciated, the site-specific mutagenesis technique typically employs a 
phage vector that exists in both a single-stranded and double-stranded form. Typical vectors 
useful in site-directed mutagenesis include vectors such as the Ml 3 phage, for example, as 
disclosed by Messing, et al., 1981, Third Cleveland Symposium on Macromolecules and 
5 Recombinant DNA , (A. Walton, Ed., Elsevier, Amsterdam). These phage are readily 
commercially available and their use is generally well known to those skilled in the art. Also, 
** plasmid vectors that contain a single-stranded phage origin of replication (Veira, et al., 1 987, 

Meth. Enzymol, 153 :3) may be employed to obtain single-stranded DNA. Alternatively, 
■ nucleotide substitutions are introduced by synthesizing the appropriate DNA fragment in vitro 

10 and amplifying it by polymerase chain reaction (PCR) procedures known fierce in the art. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector that includes within its sequence a DNA sequence that 
encodes the relevant protein. An oligonucleotide primer bearing the desired mutated 
sequence is prepared, generally synthetically, for example, by the method of Crea, et al., 
15 1 978, Proc. Natl. Acad. Sci. 25:5765). This primer is then annealed with the single-stranded 
ii protein-sequence-containing vector, and subjected to DNA-polymerizing enzymes such as E. 

coli polymerase I Klenow fragment, to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is then 
20 used to transform appropriate cells such as JM101 cells and clones are selected that include 
3 recombinant vectors bearing the mutated sequence arrangement. 

After such a clone is selected, the mutated region may be removed and placed in an 
appropriate vector for protein production, generally an expression vector of the type that is 
typically employed for transformation of an appropriate host. 
25 Most deletions and insertions, and substitutions in particular, of amino acids in NT-4 

are not expected to produce radical changes in its characteristics, and single substitutions will 
preserve at least one immune epitope in the NT-4 polypeptide. 

Since it is often difficult to predict in advance the characteristics of a variant NT-4, it 
will be appreciated that some screening will be needed to identify a variant having a desired 
30 characteristic. One can screen for enhanced trophic activity, differential neuron cell type 
specificity, stability in recombinant cell culture or in plasma (e.g. against proteolytic 
cleavage), possession of antagonist activity, oxidative stability, ability to be secreted in 
elevated yields, and the like. For example, a change in the immunological character of the 
NT-4 polypeptide, such as affinity for a given antibody, is measured by a competitive-type 
35 immunoassay. Changes in the enhancement or suppression of neurotrophic activities by the 
candidate mutants are measured by dendrite outgrowth or explant cell survival assays. 
Modifications of such protein properties as redox or thermal stability, hydrophobicity, 
susceptibility to proteolytic degradation, or the tendency to aggregate with carriers or into 
multimers are assayed by methods well known in the art. 
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Trypsin or other protease cleavage sites are identified by inspection of the encoded 
amino acid sequence for paired basic amino acid residues, e.g. combinations of adjacent 
arginyi and lysinyl residues. These are rendered inactive to protease by substituting one of 
the residues with another residue, preferably a basic residue such as glutamine or a 
5 hydrophobic residue such as serine; by deleting one or both of the basic residues; by inserting 
a prolyl residue immediately after the last basic residue; or by inserting another Residue 
between the two basic residues. 

' An amino acid sequence variant of NT-4 typically is produced by recombinant means, 
that is, by expression of nucleic acid encoding the variant NT-4 in recombinant cell culture, 

10 and, optionally, purification of the variant polypeptide from the cell culture, for example, by 
bioassay of the variant's activity or by adsorption on an immunoaffinity column comprising 
rabbit anti-NT-4 polyclonal antibodies (which will bind to at least one immune epitope of the 
variant which is also present in native NT-4). Small peptide fragments, on the order of 40 
residues pr less, are conveniently made by in vitro methods. 

1 5 Once DNA encoding NT-4 is obtained, typically it is then ligated into a replicable vector 

for further cloning or for expression. Vectors are useful for performing two functions in 
collaboration with compatible host cells {a host-vector system). One function is to facilitate 
the cloning of the DNA that encodes the NT-4, i.e., to produce usable quantities of the 
nucleic acid. The other function is to direct the expression of NT-4. One or both of these 

20 functions are performed by the vector-host system. The vectors will contain different 
components depending upon the function they are to perform as well as the host cell that is 
selected for cloning or expression. 

Each vector will contain DNA that encodes NT-4 as described above. Typically, this 
will be DNA that encodes the NT-4 in its mature form linked at its amino terminus to a 

25 secretion signal. This secretion signal preferably is the NT-4 presequence that normally 
directs the secretion of NT-4 from human cells jn vivo . However, suitable secretion signals 
also include signals from other animal NT-4, signals from NGF, NT-2, or NT-3, viral signals, 
or signals from secreted polypeptides of the same or related species. 

If the signal sequence is from another neurotrophic polypeptide, it may be the precursor 

30 sequence shown in Figure 2 which extends from the initiating methionine (M) residue of NT-2, 
NT-3, or NGF up to the arginine (R) residue just before the first amino acid of the mature 
protein, or a consensus or combination sequence from any two or more of those precursors 
taking into account homologous regions of the precursors. The DNA for such precursor 
region is ligated in reading frame to DNA encoding the mature NT-4. 

35 Expression and cloning vectors contain a nucleotide sequence that enables the vector 

to replicate in one or more selected host cells. Generally, in cloning vectors this sequence 
is one that enables the vector to replicate independently of the host chromosomes, and 
includes origins of replication or autonomously replicating sequences. Such sequences are 
well-known for a variety of bacteria, yeast and viruses. The origin of replication from the 
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well-known plasmid pBR322 is suitable for most gram negative bacteria, the 2p plasmid origin 
'for yeast and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in mammalian cells. Origins are not needed for mammalian expression vectors 
(the SV40 origin may typically be used only because it contains the early promoter). Most 
5 expression vectors are "shuttle" vectors, i.e. they are capable of replication in at least one 
class of organisms but can be transfected into another organism for expression. For example, 
a vector is cloned in E. coji and then the same vector is transfected into yeast or mammalian 
cells for expression even though it is not capable of replicating independently of the host cell 
chromosome. 1 

10 DNA also is cloned by insertion into the host genome. This is readily accomplished 

with bacillus species, for example, by including in the vector a DNA sequence that is 
complementary to a sequence found in bacillus genomic DNA. Transfection of bacillus with 
this vector results in homologous recombination with the genome and insertion of NT-4 DNA. 
However, the recovery of genomic DNA encoding NT-4 is more complex than that of an 

1 5 exogenously replicated vector because restriction enzyme digestion is required to excise the 
NT-4 DNA. 

Expression and cloning vectors should contain a selection gene, also termed a 
selectable marker. Typically, this is a gene that encodes a protein necessary for the survival 
or growth of a host cell transformed with the vector. The presence of this gene ensures that 

20 any host cell which deletes the vector will not obtain an advantage in growth or reproduction 
over transformed hosts. Typical selection genes encode proteins that (a) confer resistance 
to antibiotics or other toxins, e.g. ampicillin, neomycin, methotrexate or tetracycline, (b) 
complement auxotrophic deficiencies, or (c) supply critical nutrients not available from 
complex media, e.g. the gene encoding D-aianine racemase for bacilli. 

25 A suitable selection gene for use in yeast is the trp.1 gene present in the yeast plasmid 

YRp7 (Stinchcomb, et a!., 1979, Nature 282 :39; Kingsman, et al., 1979, Gene 7:141; 
Tschemper, et al., 1980, Gene JO: 157). The trp.1 gene provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 
44076 or PEP4-1 (Jones, 1977, Genetics 85:12). The presence of the trpl lesion in the 

30 yeast host cell genome then provides an effective environment for detecting transformation 
by growth in the absence of tryptophan. Similarly, Leu2 deficient yeast strains (ATCC 
20,622 or 38,626} are complemented by known plasmids bearing the Leu 2 gene. 

Examples of suitable selectable markers for mammalian cells are dihydrofolate 
reductase (DHFR) or thymidine kinase. Such markers enable the identification of cells which 

35 were competent to take up the NT-4 nucleic acid. The mammalian cell transformants are 
placed under selection pressure which only the transformants are uniquely adapted to survive 
by virtue of having taken up the marker. Selection pressure is imposed by culturing the 
transformants under conditions in which the concentration of selection agent in the medium 
is successively changed, thereby leading to amplification of both the selection gene and the 
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DNA that encodes NT-4. Amplification is the process by which genes in greater demand for 
the production of a protein critical for growth are reiterated in tandem within the 
chromosomes of successive generations of recombinant cells. Increased quantities of NT-4 
are synthesized from the amplified DNA. 
5 For example, cells transformed with the DHFR selection gene are first identified by 

culturing all of the transformants in a culture'medium which contains methotrexate (Mtx), a 
competitive antagonist of DHFR. An appropriate host cell in this case is the Chinese hamster 
ovary (CHO) eel! line deficient in DHFR activity, prepared and propagated as described by 
' ■ Uriaub and Chasin, 1980, Proc. Nat. Acad. Sci. 77:4216. A particularly useful DHFR* is a 

10 mutant DHFR that is highly resistant to Mtx (EP 1 17,060A). T:,e transformed cells then are 
exposed to increased levels of Mtx. This leads to the synthesis of multiple copies of the 
DHFR gene and, concomitantly, multiple copies of other DNA comprising the expression 
vectors, such as the DNA encoding NT-4. Alternatively, host cells transformed by an 
expression vector comprising DNA sequences encoding NT-4, DHFR protein, and 

15 aminoglycoside 3' phosphotransferase (APH) can be selected by cell growth in medium 
containing an aminoglycosidic antibiotic such as kanamycin or neomycin or G418. Because 
eukaryotic cells do not normally express an endogenous APH activity, genes encoding APH 
protein, commonly referred to as neo r genes, may be used as dominant selectable markers 
in a wide range of eukaryotic host cells, by which cells transformed by the vector can readily 

20 be identified. 

Other methods, vectors and host cells suitable for adaptation to the synthesis of NT-4 
in recombinant vertebrate cell culture are described in Gething, et al., 1981, Nature 
293 :620: Mantei, et al., 1979, Nature 281:40; and Levinson, et al., EP 117,060A and 
117,058A. A particularly useful piasmid for mammalian cell culture expression of NT-4 is 
25 pRK5 (EP Pub. No. 307,247) or pSVI6B (PCT Pub. No. WO90/08291, published 6/13/91 ). 

Expression vectors, unlike cloning vectors, should contain a promoter which is 
recognized by the host organism and is operably linked to the NT-4 nucleic acid. Promoters 
are untranslated sequences located upstream from the start codon of a structural gene 
(generally within about 100 to 1000 bp) that control the transcription and translation of 
30 nucleic acid under their control. They typically fall into two classes, inducible and 
constitutive. Inducible promoters are promoters that initiate increased levels of transcription 
from DNA under their control in response to some change in culture conditions, e.g. the 
presence or absence of a nutrient or a change in temperature. At this time a large number 
of promoters recognized by a variety of potential host cells are well known. These promoters 
35 are operably linked to NT-4-encoding DNA by removing them from their gene of origin by 
restriction enzyme digestion, followed by insertion 5' to the start codon for NT-4. This is not 
to say that the genomic NT-4 promoter is not usable. However, heterologous promoters 
generally will result in greater transcription and higher yields of expressed NT-4. 
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Nucleic acid is operabiy linked when it is placed into a functidnal relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
operabiy linked to DNA for a polypeptide if it is expressed as a preprotein which participates 
in the secretion of the polypeptide; a promoter or enhancer is operabiy linked to a coding 
5 sequence if it affects the transcription of the sequence; or a ribosome binding site is operabiy 
linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
operabiy linked means that the DNA sequences being linked are contiguous and, in the case 
of a secretory leader, contiguous and in reading phase. Linking is accomplished by ligation 
at convenient restriction sites, if such sites do. not exist then synthetic oligonucleotide 

10 adaptors or linkers are used in accord with conventional practice. 

Promoters suitable for use with prokaryotic hosts include the ^-lactamase and lactose 
promoter systems (Chang, et al., 1978, Nature 275 :615; Goeddel, et al., 1979, Nature 
281:544), alkaline phosphatase, a tryptophan (trp) promoter system (Goeddel, 1 980, Nucleic 
Acids Res. 8:4057 and EPO Appln. Publ. No. 36,776) and hybrid promoters such as the tac 

15 promoter (H. de Boer, et al., 1983, Proc. Nat'I. Acad. Sci. 80:21). However, other known 
bacterial promoters are suitable. Their nucleotide sequences have been published, thereby 
enabling a skilled worker operabiy to ligate them to DNA encoding NT-4 (Siebenlist, et al., 
1980, Cell 2Q:269) using linkers or adaptors to supply any required restriction sites. 
Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence 

20 operabiy linked to the DNA encoding NT-4. 

Suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphogiycerate kinase (Hitzeman, et al., 1980, J. Biol. Chem. 25^:2073) or other 
glycolytic enzymes (Hess, et al., 1968, J. Adv. Enzyme Reg. 7:149; Holland, 1978, 
Biochemistry 17:4900), such as enoiase, glyceraldehyde-3-phosphate dehydrogenase, 

25 hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 
3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage 
of transcription controlled by growth conditions, are the promoter regions for alcohol 

30 dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, metallothionein, g!yceraldehyde-3-phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters 
for use in yeast expression are further described in R. Hitzeman, eta 1„ EP 73,657A. Yeast 
enhancers also are advantageously used with yeast promoters. 

35 Transcription of NT-4-encoding DNA in mammalian host cells is controlled by promoters 

obtained from the genomes of viruses such as polyoma, cytomegalovirus, adenovirus, 
retroviruses, hepatitis-B virus and most preferably Simian Virus 40 (SV40), or from 
heterologous mammalian promoters, e.g. the actin promoter. The early and late promoters 
of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also 
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contains the SV40 viral origin of replication (Rers, eta I., 1 978, Nature 1273 : 113). Of course, 
promoters from the host cell or related species also are useful herein. 

Transcription of NT-4-encoding DNA in mammalian host cells may be increased by 
inserting an enhancer sequence into the vector. An enhancer is a nucleotide sequence, 
5 usually about from 10-300 bp, that acts on a promoter to increase its transcription and does 
so in a manner that is 'relatively orientation and position independent. Many enhancer 
sequences are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein 
and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. 
Examples include the SV40 enhancer on the late side of the replication origin (bp 100-270), 
10 the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenoviral enhancers. The enhancer may be spliced into the vector at 
a position 5' or 3' to the NT-4-encoding sequence, but is preferably located at a site 5' from 
the promoter. ' ' 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, 
15 human or nucleated cells from other multicellular organisms) will also contain sequences 
necessary for the termination of transcription and for stabilizing the mRNA. Such sequences 
are commonly available from the 5' and, occasionally 3' untranslated regions of eukaryotic 
or viral DNAs or cDNAs. These regions contain regions that are transcribed as polyadenylated 
segments in the untranslated portion of the mRNA encoding NT-4. The 3' untranslated 
20 regions also include transcription termination sites. 

Suitable host cells for cloning or expressing the vectors herein are the prokaryote, 
yeast or higher eukaryote cells described above. Suitable prokaryotes include gram negative 
or gram positive organisms, for example E. coli or bacilli. A preferred cloning host is E. coli 
294 (ATCC 31,446) although other gram negative or gram positive prokaryotes such as E. 
25 coli B, E. cpji X1 776 (ATCC 31 ,537), E. cpji W31 1 0 (ATCC 27,325), pseudomonas species, 
or Serrat'ia marcesans are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are 
suitable hosts for NT-4-encoding vectors. Saccharomvces cerevisiae , or common baker's 
yeast, is the most commonly used among lower eukaryotic host microorganisms. However, 
30 a number of other genera, species and strains are commonly available and useful herein. 

Suitable host cells for the expression of NT-4 are derived from multicellular organisms. 
Such host cells are capable of complex processing and glycor^ation activities. In principle, 
any higher eukaryotic cell culture is workable, whether frcm vertebrate or invertebrate 
culture, although cells from mammals such as humans are preferred. Propagation of such 
35 cells in culture is per se well known. (Tissue Culture , 1973, Kruse and Patterson, Eds., 
Academic Press, New York). Examples of useful mammalian host cell lines are VERO and 
HeLa cells, Chinese hamster ovary cell lines, the WI38, BHK, COS-7, MDCK cell lines and 
human embryonic kidney cell line 293. 
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Host cells are transformed with the above-described expression or cloning vectors and 
• cultured in conventional nutrient media modified as is appropriate for inducing promoters or 
selecting transformants containing amplified genes. The culture conditions, such as 
temperature, pH and the like, suitably are those previously used with the host cell selected 
5 for cloning or expression, as the case may be, and will be apparent to the ordinary artisan. 

NT-4 preferably is recovered from the, culture medium as a secreted protein, although 
it also may be recovered from host cell lysates when directly expressed without a secretory 
signal. When NT-4 is expressed in a recombinant cell other than one of human origin, the NT- 
4 is thus completely free of proteins of human origin. However, it is necessary to purify NT-4 

TO from recombinant cell proteins in order to obtain preparations that are substantially 
homogeneous as to protein. As a first step, the culture medium or lysate is centrifuged to 
remove particulate cell debris. NT-4 thereafter is purified from contaminant soluble proteins, 
for example, by fractionation on immunoaffinity or ion exchange columns; ethanol 
precipitation; reverse phase HPLC; chromatography on silica or on a cation exchange resin 

15 such as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; or gel 
electrophoresis using, for example, Sephadex G-75. faT-4 variants in which residues have 
been deleted, inserted or substituted relative to native NT-4 are recovered in the same fashion 
as native NT-4, taking account of any substantial changes in properties occasioned by the 
variation. For example, preparation of an NT-4 fusion with another protein, e.g. a bacterial 

20 or viral antigen, facilitates purification because an immunoaffinity column containing antibody 
to the antigen can be used to adsorb the fusion protein. A protease inhibitor such as phenyl 
methyl sulfonyl fluoride (PMSF) may be useful to inhibit proteolytic degradation during 
purification, and antibiotics may be included to prevent the growth of adventitious 
contaminants. One skilled in the art will appreciate that purification methods suitable for 

25 native NT-4 may require modification to account for changes in the character of NT-4 or its 
variants upon expression in recombinant cell culture. 

Peptide fragments of NT-4 and modified forms of NT-4 also are included within the 
scope of this invention. Peptide fragments having up to about 40 amino residues may be 
conveniently prepared by in vitro synthesis. 

30 Covalent modifications are made by reacting targeted amino acid residues of an NT-4 

polypeptide or peptide fragment with an organic derivatizing agent that is capable of reacting 
with selected side chains or the N~ or C-terminal residues. 

Cysteinyl residues most commonly are reacted with o-haloacetates (and corresponding 
amines), such as chioroacetic acid or chloroacetamide, to give carboxymethyl or 

35 carboxyamidomethyl derivatives. Cysteinyl residues also are derivatized by reaction with 
bromotrifluoroacetone, a-bromo-£-{5-imidozoyl)propionic acid, chloroacetyl phosphate, N- 
alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p- 
chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-1 ,3- 
diazole. 
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Histidyl residues are derivatized by reaction with diethyfpyrocarbonate at pH 5.5-7.0 
because this agent is relatively specific for the histidyl side chain. Para-bromophenacyl 
bromide also is useful; the reaction is preferably performed in 0.1 M sodium cacodyiate at pH 
6.0. 

5 Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid 

anhydrides. Derivatization with these agents has the effect of reversing the charge of the 
lysinyl residues. Other suitable reagents for derivatizing 0-amino-containing residues include 
imidoesters such as methyl picoiinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; 
trinitrobenzenesulfonicacid; O-methylisourea; 2,4-pentanedione; andtransarninase-catal^zed 
10 reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, 
among them phenylglyoxal, 2,3-butanedione, 1 ,2-cyclohexanedione, and ninhydrin. 
Derivatization of arginine residues requires that the reaction be performed in alkaline 
conditions because of the high pK, of the guanidine functional group. Furthermore, these 
15 reagents may react with the groups of lysine as well as the arginine epsilon-amino group. 

The specific modification of tyrosyl residues may be made, with particular interest in 
introducing spectral labels intc tyrosyl residues by reaction with aromatic diazonium 
compounds or tetranitromethane. Most commonly, N-acetylimidizole and tetranitromethane 
are used to form O-acetyl tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl 
20 residues are iodinated using 12B I or 131 l to prepare labeled proteins for use in 
radioimmunoassay, the chloramlne T method described above being suitable. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with 
carbodiimides (R'-N = C-N-R') such as 1~cyclobexy)-3-f2-morpholinyl-4-ethyl) carbodiimide 
or 1-ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl 
25 residues are converted to asparaginy! and glutaminyl residues by reaction with ammonium 
ions. 

Derivatization with bifunctional agents is useful for crosslinking NT-4 to a water- 
insoluble support matrix or surface for use in the method for purifying anti-NT-4 antibodies, 
and vice versa. Commonly used crosslinking agents include, e.g., 1,1-bis(diazoacetyf)-2- 

30 phenyiethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azido- 
salicyiic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), and bifunctional maleimides such as bis-N-maleimido-1 ,8- 
octane. Derivatizing agents such as methyl-3-[(p-azidophenyI)dith»o]propioimidate yield 
photoactivatable intermediates that are capable of forming crosslinks in the presence of light. 

35 Alternatively, reactive water-insoluble matrices such as cyanogen bromide-activated 
carbohydrates and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691 ,01 6; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein immobilization. 
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Glutaminyl and asfraraginyl residues are frequently deamidated to the corresponding 
glutamyl and aspartyl residues. Alternatively, these, residues are deamidated under mildly 
acidic conditions. Either form of these residues falls within the scope of this invention. 

Other modifications include hydroxylation of proline and lysine, phosphorylation of 
5 hydroxy! groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, 
arginine, and histidine side chains (Creighton, Proteins: Structure and Molecular Properties. 
W.H. Freeman & Co., San Francisco, pp. 79-86), acetylation of the N-terminat amine, and 
amiddtion of any C-terminal carboxyl group. NT-4 also is covalently linked to 
nonproteinaceous polymers, e.g. polyethylene glycol, polypropylene glycol or 
1 0 polyoxyalkylenes, in the manner set forth in U.S. Pat. App. No. 07/275,296 or U.S. Pat. Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

NT-4 in purified form, that is, in a form where the NT-4 is substantially free of other 
polypeptides or peptides, imay be entrapped in microcapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization (for example, hydroxymethylcellulose 
15 or gelatin-microcapsules and poly-(methylmethacylate) microcapsules, respectively), in 
colloidal drug delivery systems (for example, liposomes, albumin microspheres, 
microemulsions, nano-particles and nanocapsules) or in macroemulsions. Such techniques 
are disclosed in Remington's Pharmaceutical Sciences , 16th edition, 1980, 
(A. Osol, Ed). 

20 NT-4 is believed to find use as an agent for enhancing the survival or inducing the 

outgrowth of nerve cells. It, therefore, is useful in the therapy of degenerative disorders of 
the nervous system ("neurodegenerative diseases"), including such diseases as Alzheimer's 
disease, Parkinson's disease, Huntington's chorea, ALS, peripheral neuropathies, and other 
conditions characterized by necrosis or loss of neurons, whether central, peripheral, or 

25 motorneurons. In addition, it may be useful for treating damaged nerve cells, e.g., nerves 
damaged by traumatic conditions such as burns and wounds, diabetes, kidney dysfunction, 
and the toxic effects of chemotherapeutics used to treat cancer and AIDS. It also is useful 
as a component of culture media for use in culturing nerve cells in vitro . Finally, NT-4 
preparations are useful as standards in assays for NT-4 and in competitive-type receptor 

30 binding assays when labelled with radioiodine, enzymes, fluorophores, spin labels, and the 
like. 

Therapeutic formulations of NT-4 are prepared for storage by mixing NT-4 having the 
desired degree of purity with optional physiologically acceptable carriers, excipients or 
stabilizers ( Remington's Pharmaceutical Sciences , supra ), in the form of lyophiiized cake or 
35 aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, citrate and 
other organic acids; antioxidants including ascorbic acid; low molecular weight (less than 
about 10 residues) polypeptides; proteins, such as serum albumin, gelatin or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as 
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glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or 
nonionic surfactants such as Tween, Pluronics or PEG. 
5 NT-4 to be used for in vivo administration must be sterile. This is readily accomplished 

by filtration through sterile filtration membranes, prior to or following Jyophilization and 
reconstitution. NT-4 ordinarily will be stored in lyophilized form. 

Therapeutic NT-4 compositions generally are placed into a container having a sterile 
' . access port, for example, an intravenous solution bag or vial having a stopper pierceable by 
10 a hypodermic injection needle. 

NT-4 optionally is combined with or administered in concert with other neurotrophic 
factors including NGF, NT-3, and/or BDNF and is used with other conventional therapies for 
degenerative nervous disorders. 

The route of NT-4 or NT-4 antibody administration is in accord with known methods, 
15 e.g. injection or infusion by intravenous, intraperitpneaf, intracerebral, intramuscular, 
intraocular, intraarterial or intralesional routes, topical administration, or by sustained release 
systems as noted below. NT-4 is administered continuously by infusion into the fluid 
reservoirs of the CNS, although bolus injection is acceptable. NT-4 preferably is administered 
into the ventricles of the brain or otherwise introduced into the CNS or spinal fluid. It should 
20 be administered bv an indwelling catheter using a continuous administration means such as 
a pump, or it can be administered by implantation, e.g., intracerebral implantation, of a 
sustained-release vehicle. More specifically, NT-4 can be injected through chronically 
implanted cannulas or chronically infused with the help of osmotic minipumps. Subcutaneous 
pumps are available that deliver proteins through a small tubing to the cerebral ventricles. 
25 Highly sophisticated pumps can be refilled through the skin and their delivery rate can be set 
without surQical intervention. Examples of suitable administration protocols and delivery 
systems involving a subcutaneous pump device or continuous intracerebroventricular infusion 
through a totally implanted drug delivery system are those used for the administration of 
dopamine, dopamine agonists, and cholinergic agonists to Alzheimer patients and animal 
30 models for Parkinson's disease described by Harbaugh, 1987, J. Neural Transm. Suppl., 
. 24-271; and DeYebenes, et al., 1987, Mov. Disord. 2:143. NT-4 antibody is administered 
in the same fashion, or by administration into the blood stream or lymph. 

Suitable examples of sustained release preparations include semipermeable polymer 
matrices in the form of shaped articles, e.g. films, or microcapsules. Sustained release 
35 matrices include polyesters, hydrogels, polylactides {U.S. Pat. No. 3,773,919, EP 58,481), 
copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman, et al., 1983, 
Biopolymers £2:5471, poly (2-hydroxyethyI-methacrylate) (Langer, et ai., 1981, J. Biomed. 
Mater. Res. 15:167; Langer, 1982, Chem. Tech. 12:98), ethylene vinyl acetate (Langer, et 
al, Id.) or poly-D-(-)-3-bydroxybutyric acid [EP 133,988A). Sustained release NT-4 
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compositions also include liposomally entrapped NT-4. Liposomes 'containing NT-4 are 
prepared by methods known rjer se. (Epstein, et al., 1985, Proc. Natl. Acad. Sci. 82:3688; 
Hwang, et al., 1980,Proc. Natl. Acad. Sci. USA 77:4030; DE 3,218,121A; EP 52322A; 
EP36676A; EP 88046A; EP143949A; EP 1 42641 A; Japanese Pat. App. No. 83-1 1 8008; 
5 U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324A). Ordinarily the liposomes are 
of the small (about 200-800 Angstroms) unilamelar type in which' the lipid content is Greater 
than about 30 mol. % cholesterol, the selected proportion being adjusted for the optimal NT-4 
therapy. 

An effective amount of NT-4 to be employed therapeutically will depend, for example, 
1 0 upon the therapeutic objectives, the route of administration, and the condition of the, patient. 

Accordingly, it will be necessary for the therapist to titer the dosage and modify the route of 

i 

administration as required to obtain the .optimal therapeutic effect. A typical daily dosage 
might range from about \ //g/kg to up to 100 mg/kg or more, depending on the factors 

mentioned above. Typically, the clinician will administer Nt-4 until a dosage is reached that 

i 

1 5 repairs, maintains, and, optimally, reestablishes neuron function. The progress of this therapy 
is easily monitored by conventional assays. 

Polyclonal antibodies to NT-4 generally are raised in animals by multiple subcutaneous 
(sc) or intraperitoneal (ip) injections of NT-4 and an adjuvant. It may be useful to conjugate 
NT-4 or a fragment containing the target amino acid sequence to a protein which is 

20 immunogenic in the species to be immunized, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a bifunctional or derivatizing 
agent, for example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine 
residues), N-hydroxysuccintmide (through lysine residues), glutaraldehyde, succinic anhydride, 
SOCl 2 , or R 1 N - C - NR. 

25 Animals are immunized against the immunogenic conjugates or derivatives by 

combining 1 mg or 1 //g of conjugate (for rabbits or mice, respectively) with 3 volumes of 
Freund's complete adjuvant and injecting the solution intradermal^ at multiple sites. One 
month later the animals are boosted with 1/5 to 1/10 the original amount of conjugate in 
Freund's complete adjuvant by subcutaneous injection at multiple sites. 7 to 14 days later 

30 animals are bled and the serum is assayed for anti-NT-4 titer. Animals are boosted until the 
titer plateaus. Preferably, the animal is boosted with the conjugate of the same NT-4 
polypeptide, but conjugated to a different protein and/or through a different cross-linking 
agent. Conjugates also can be made in recombinant cell culture as protein fusions. Also, 
aggregating agents such as alum are used to enhance the immune response. 

35 Monoclonal antibodies are prepared by recovering spleen cells from immunized animals 

and immortalizing the cells in conventional fashion, e.g. by fusion with myeloma cells or by 
EB virus transformation and screening for clones expressing the desired antibody. 

NT-4 antibodies are useful in diagnostic assays for NT-4 or its antibodies. The 
antibodies are labelled in the same fashion as NT-4 described above and/or are immobilized 
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on an insoluble matrix. In one embodiment of a receptor binding assay, an antibody 
composition which binds to all or a selected plurality of members of the NT-4 family is 
immobilized on an insoluble matrix, the test sample is contacted with the immobilized 
antibody composition in order to adsorb all NT-4 family members, and then the immobilized 
5 family members are contacted with a plurality of antibodies specific for each member, each 
of the antibodies being individually identifiable as specific for a predetermined family member, 
as by unique labels such as discrete fluorophores or the like. By determining the presence 
and/or amount of each unique label, the relative proportion and amount of each family 
member can be determined. 
10 NT-4 antibodies also are useful for the affinity purification of NT-4 from recombinant 

cell culture or natural sources. NT-4 antibodies that do not detectably cross-react with NGF, 
NT-3, or BDNF can be used to purify NT-4 free from these other family members. 

Suitable diagnostic assays for NT-4 and its antibodies are well known rier se. In 
addition to the bioassay described above, competitive, sandwich and steric inhibition 
1 5 immunoassay techniques are useful. The competitive apd sandwich methods employ a phase 
separation step as an integral part of the method while steric inhibition assays are conducted 
in a single reaction mixture. Fundamentally, the same procedures are used for the assay of 
NT-4 and for substances that bind NT-4, although certain methods will be favored depending 
upon the molecular weight of the substance being assayed. Therefore, the substance to be 
20 tested is referred to herein as an analyte, irrespective of its status otherwise as an antigen 
or antibody, and proteins which bind to the analyte are denominated binding partners, 
whether they be antibodies, cell surface receptors or antigens. 

Analytical methods for NT-4 or its antibodies all use one or more of the following 
reagents: labelled analyte analogue, immobilized analyte analogue, labelled binding partner, 
25 immobilized binding partner and steric conjugates. The labelled reagents also are known as 
"tracers". 

The label used is any detectable functionality which does not interfere with the binding 
of analyte and its binding partner. Numerous labels are known for use in immunoassay, 
examples including enzymes such as horseradish peroxidase, radioisotopes such as 14 C and 

30 131 l, fluorophores such as rare earth chelates or fluorescein, stable free radicals and the like. 
Conventional methods are available to covalently bind these labels to proteins or 
polypeptides. Such bonding methods are suitable for use with NT-4 or its antibodies, all of 
which are proteinaceous. 

Immobilization of reagents is required for certain assay methods. Immobilization entails 

35 separating the binding partner from any analyte which remains free in solution. This 
conventionally is accomplished by either insolubilizing the binding partner or analyte analogue 
before the assay procedure, as by adsorption to a water insoluble matrix or surface (Bennich, 
et al. f U.S. Pat. No. 3,720,760}, by covalent coupling (for example using glutaraldehyde 
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cross-Nnking), or' by insolubilizing the partner or analogue afterward, e.g., by 
immurroprecipitation. 

Other assay methods, known as competitive or sandwich assays, are well established 
and widely used in the commercial diagnostics industry. 
5 Competitive assays rely on the ability of a labelled analogue (the "tracer") to compete 

with the test sample anaiyte for a limited number of binding sites on a common binding 
partner. The. binding partner generally is insolubilized before or after the competition and then 
the tracer and anaiyte bound to the binding partner are separated from the unbound tracer 
and anaiyte. This separation is accomplished by decanting (where the binding partner was 

10 preinsolubilized) or by centrifuging (where the binding partner was precipitated after the 
competitive reaction). The amount of test sample anaiyte is inversely proportional to the 
amount of bound tracer as measured by the amount of marker substance. Dose-response 
curves with known amounts of anaiyte are prepared and compared with the test results in 
order to quantitatively determine the amount of anaiyte present in the test sample. These 

15 assays are called ELISA systems when enzymes are used as the detectable markers. 

Another species of competitive assay, called a "homogeneous" assay, does not require 
a phase separation. Here, a conjugate of an enzyme with the anaiyte is prepared and used 
such that when anti-analyte binds to the anaiyte the presence of the anti-analyte modifies the 
enzyme activity. In this case, NT-4 or its immunologically active fragments are conjugated 

20 with a bifunctional organic bridge to an enzyme such as peroxidase. Conjugates are selected 
for use with anti-NT-4 so that binding of the anti-NT-4 inhibits or potentiates the enzyme 
activity of the label. This method per se is widely practiced under the name of EMIT. 

Steric conjugates are used in steric hindrance methods for homogeneous assay. These 
conjugates are synthesized by covalently linking a low molecular weight hapten to a small 

25 anaiyte so that antibody to hapten substantially is unable to bind the conjugate at the same 
time as anti-analyte. Under this assay procedure the anaiyte present in the test sample will 
bind anti-analyte, thereby allowing anti-hapten to bind the conjugate, resulting in a change 
in the character of the conjugate hapten, e.g., a change in fluorescence when the hapten is 
a fluorophore. 

30 Sandwich assays particularly are useful for the determination of NT-4 or NT-4 

antibodies. In sequential sandwich assays an immobilized binding partner is used to adsorb 
test sample anaiyte, the test sample is removed as by washing, the bound anaiyte is used to 
adsorb labelled binding partner and bound material then separated from residual tracer. The 
amount of bound tracer is directly proportional to test sample anaiyte. In "simultaneous" 

35 sandwich assays test sample is not separated before adding the labelled binding partner. A 
sequential sandwich assay using an anti-NT-4 monoclonal antibody as one antibody and a 
polyclonal anti-NT-4 antibody as the other is useful in testing samples for NT-4 activity. 
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The foregoing are 'merely exemplary diagnostic assays for NT-4 and antibodies. Other 
methods now or hereafter developed for the determination of these analytes are included 
within the scope hereof, including the bioassay described above. 

The following examples are offered by way of illustration and not by way of limitation. 
5 EXAMPLE I 

Attempts to identity and isolate DNA encoding NT-4 from human genomic and cDNA 
libraries using NGF and BDNF probes were unsuccessful. Instead, to identify the NT-4 gene, 
it was necessary to amplify, human genomic DNA using the polymerase chain reaction (PCR) 
(Mullis, et al., 1987, Cold Spring Harbor Symp. Quant. Biol. 51-263). Human genomic 
10 placenta! DNA (prepared as described in Maniatis, et al., 1982, Molecular Cloning: A 
Laboratory Manual , Cold Spring Harbor Laboratory, New Yor^f, in the section on preparing a 
genomic DNA library) was employed as template for the above-identified primers, since the 
active forms of NGF, BDNF, and NT-3 are encoded by a single exon {Leibrock, et al., supra : 
Hohn, et al., supra ; Maisonpierre, et al., supra ; Rosenthal, et al., supra ). 
1 5 Amino acid sequences for NGF, BDNF, and NT-3 were scanned for regions of common 

homology. A number of these regions were identified and single stranded primer pools 
containing restriction sites for £aj, Xba, and EcoRI were prepared that were complementary 
to all possible sequences of DNA for the plus and minus strands of the selected NGF, BDNF, 
and NT-3 sequences. The primer pool for the sense strand corresponded to residues 50 - 58 
20 of (mature human £NGF) NGF, designated NGX-54. The sense primer comprised the 
following sequence of alternatives {SEQ ID NOS. 93, 94, 95 and 96, respectively): 

5 1 -CCGCGCGCTCTAGAGTCGACAAGCAGTACTTCTATGAGACGAAGTGT— 3 1 

A A T T TC A CCGA C 
25 T 

A 



The primer pool for the antisense strand corresponded to residues 102 - 110 of NGF 
30 (designated AR1 ) and comprised the following sequence of alternatives (SEQ ID NOS. 97, 98, 
99 and 100, respectively): 

5 ■ -CGGCTCAGGGCCGAATTCGCACACGCAGGAAGTATCTATCCTTAT-3 • 

ATAACG GAT GG 
35 G T G G A 

A 

Note that each primer sequence has a restriction site at its 5' end in order to facilitate cloning 
40 the amplified sequences. Careful selection of amplification conditions allowed amplification 
of NT-4 sequence despite the fact that these pools were considerably larger than the 
conventional pools used heretofore for shorter amino acid sequences (ranging from 32 to 
32,000 fold degeneracy. (Lee, et al., 1988, Science 239:1 288; Strathmann, et al., 1989, 
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Proc. Nat. Acad. Sci. £g:7407; Leibrock, et a!', supra ). The primers were employed to 
prepare amplified DNA which was then sequenced. The conditions for amplification were as 
follows: 

!. PCR with Human genomic placental DNA 

5 denat. 95°C 5' once initially 
denat. 95°C V } 
anneal 55°C V } 45 cycles 
extens. 72°C 1 ' } 

10 extens. 72°C 15' 

t 

i 

10 //II Ox buffer (final = 50 mM KCI, 10 mM Tris pH 8.4, 3.0 mM MgCI 2 ) 
3 p\ human genomic DNA (3 //g) 

7.5 r\glfj\ primer (approx. 1 fig « ~ 2.6 jjM of 33 mer, 
15 therefore 10 3 degen = nM, 10 6 ~ pM) 

7.5 ng///l primer 

10 /yllOx dNTPs (final - 0.2 mM dNTPs) 
1 jj\ Taq polymerase 

61 u\ dH 7 0 

20 ^00p\V 1 

II. Cut with £a]l and Eco Rl, generate and gel purify fragments of the expected size, about 
210 bp, and subclone into the M13-based vector, M13mp18 (Pharmacia). 

NGF, BDNF, and NT-3 clones were identified by hybridization with oligonucleotides 

25 derived from unique regions of their respective cDNA sequences. Plasmids containing non- 
hybridizing inserts were sequenced (Smith, 1 980, Meth. Enzymol. 65:560) and their potential 
translation products were analyzed for homology with NGF, BDNF, and NT-3. 

This procedure revealed the presence of about 500 NGF, BDNF, and NT-3 clones, and 
78 unrelated clones. In addition, three DNA fragments encoding part of a novel NGF-related 

30 factor were identified and collectively designated NT-4. The low abundance of NT-4 clones 
generated by PCR was caused by the poor homology between its DNA sequence and the PCR 
primers. 

Screening of a human fetal brain cDNA library (Rosenthal, et al., 1987, EMBO J. 
£:3641) using the genomic placental clone as a probe did not yield any positive clones. To 
35 obtain a complete human NT-4 homolog, a human genomic library was also screened 
(Maniatis, et al„ 1 978, Cell 15:687) and a 6-kb DNA fragment was isolated. This fragment 
was found to contain a single open reading frame encoding a polypeptide of 168 amino acids 
encompassing the NT-4 mature polypeptide. 

The full nucleotide sequence and deduced amino acid sequence of human mature NT-4 
40 and at least a portion of its precursor region is shown in Figure 1 . The entire precursor 
region, including the signal sequence, may be as depicted between the initiating methionine 
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shown and the last Arg of 'the cleavage site before the mature sequence 'begins. If this is the 
case, the precursor region of NT-4 is much shorter than the precursor regions of NGF, BDNF, 
and NT-3, shown in Figure 2. Assignment of the initiation codon for NT-4 was made based 
on the location of the initiation codon in NGF, BDNF, and NT-3. The amino acid sequence 
5 of mature human NT-4 has approximately 46%, 55%, and 52% sequence homology (identity) 
to mature human NGF, ^DNF, and NT-3, respectively, based on the alignment of the 
sequences as shown in Figure 2. 

The active, mature forms of NGF, BDNF, and NT-3 are homodimers of 13-14 kD 
proteins that are generated from their ca. 30 kD precursors (Leibrock, et al., supra ; 
10 Maisonpierre, et al., supra ; Hohn, et al., supra ; Greene and Shooter, 1980, Ann Rev. 
Neurosci. 3:353). The NT-4 precursor protein sequence also showed a potential tetrabasic 
cleavage site before the mature region begins, indicating that all four members of this protein 
family may be similarly processed. Processing at this site would result in a 13.14 kD (130 
amino acid) polypeptide. 

1 5 To assess the possible function of NT-4, its tissue distribution was determined by 

Northern blot analysis. In the rat, NT-4 mRNA was found in varying levels in every tissue 
examined, i.e., heart, muscle, kidney, liver, spleen, gut, lung, and spinal cord, and in several 
brain regions, including cerebellum and cortex. This broad organ localization of NT-4 mRNA 
suggested that in the peripheral nervous system, NT-4 could serve as a target-derived trophic 

20 factor for sympathetic, sensory, and/or motor neurons. This theory is tested by expressing 
DNA encoding recombinant human NT-4 and assaying its various activities. 

EXAMPLE II 

The following protocol for expressing NT-4 DNA and purifying the resultant NT-4 is 
expected to provide sufficient NT-4 for assay purposes. This example also provides expected 
25 assays to be employed to test the purified NT-4 and compare it to NGF. 

A cytomegalovirus-baseci expression vector called pRK5, described in Gorman, et al., 
1990, DNA and Protein Engineering Techniques 2:1 and in EP Pub. No. 307,247, published 
15 March 1989, is employed as the expression vector. The NT-4 genomic DNA is cut from 
the phage in which it was cloned. This DNA fragment is then ligated into pRK5 previously 
30 cut with the appropriate restriction enzymes to accommodate the DNA fragment using 
standard ligation methodology (Maniatis et al., 1982, Molecular Cloning: A Laboratory 
Manual . Cold Spring Harbor Laboratory, New York). The resulting vector is called pRK-5hNT- 
4. 

A human embryonal kidney 293 cell line (Graham, et al., 1977, J. Gen. Virol. 36:59) 
35 is grown to confluence. Ten /yg of the NT-4 plasmid DNA (pRK-5hNT-4) is mixed with 1 jjq 
of DNA encoding the VA RNA gene (Thimmappaya, et aL, 1982, Cell 31:543) and dissolved 
in 500 p\ of 1 mM Tris-HCI, 0.1 mM EDTA, 0.227 M CaCI 2 . Added to this (dropwise while 
vortexing) is 500 fj\ of 50 mM HEPES (pH 7.35), 280 mM NaCI, 1.5 mM NaP0 4 , and the 
precipitate is allowed to form for 10 mm. at 25 °C. The suspended precipitate is then added 
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to the cells (in 100 mM plate) and allowed to settle for four hours in the incubator. The 
medium is then aspirated off and 2 ml of 20% glycerol in. phosphate-buff ered saline is added 
for 30 sec. The cells are washed twice with 5 ml of serum-free medium, then fresh medium 
is added, and the cells are incubated for five days. 
5 The 293 cells are also transfected in the same way with pRK5 alone. * 

Twenty-four hours after the transfections, the medium is replaced and cells are 
incubated for 12 hours in the presence of 200 /yCi/ml 36 S-cysteine and 200 /iCi 36 S- 
methionine. Conditioned medium is then collected, concentrated 5-fold by lyophilization, and 
loaded on a 15% SDS gel, which is subsequently enhanced, dried, and exposed to film'for 

10 two hours. These data are expected to indicate the presence of a polypeptide of 
approximately the expected size (14-15 kD). 

Large-scale expression of NT-4 is performed by transiently introducing by the dextran 
sulfate method (Sompayrac and Danna, 1981, Proc. Natl. Acad. Sci. 12:7575) 700 jjq of 
pRK-5hNT-4 into the human embryonal kidney 293 cell line grown to maximal density (1.5 

1 5 liters) in a 3-liter Belco microcarrier spinner flask. The cells are first concentrated from the 
spinner flask by centrifugation, and washed with phosphate-buffered saline (PBS), and the 
DNA-dextran precipitate is incubated on the cell pellet for four hours. The cells are treated 
with 20% glycerol for 90 seconds, washed with a medium such as 50:50 DMEM:F-1 2 
medium, and re-introduced into a 3-liter spinner flask containing 1 .5 liter of the above medium 

20 plus 5 //g/ml bovine insulin and 0.1 pQ/m\ bovine transferrin. The above protocol is performed 
for three separate 3-liter cultures. 

After 4 days approximately 5 liters of conditioned media from the large-scale 
expression described above is centrifuged and filtered to remove cells and debris, and 
concentrated 100-fotd. The buffer, salts, and other small molecules are exchanged by 

25 dialysis into 25 mM sodium borate, pH 9.0, and 4 M urea, and applied to a 5 cm. x 5 cm. 
DEAE Sepharose Fast-Row ion-exchange chromatography column (Pharmacia, Inc.). The pH 
of column effluent (495 ml) is neutralized (pH 7.0) by the addition of 0.1 volume of 250 mM 
3-{N-morphoiino)propanesulfonic acid (MOPS) buffer to give a final composition of 25 mM 
MOPS, pH 7.0, and 4 M urea. This sample is applied to a 2.5 cm. x 2.5 cm. S~Sepharose 

30 ion-exchange chromatography column (Pharmacia), washed, and eluted with 25 mM MOPS, 
pH 7.0, 4 M urea, and 0.5 M NaCI (40 ml). 

Two different assays indicate the presence of recombinant human NT-4 in the S- 
Sepharose salt eluant (130 ng/ml, 5 //g total): 1) 48-hour neuronal survival and neurite 
outgrowth in three types of chick embryonal peripheral ganglionic neurons: paravertebral 

35 sympathetic chain ganglion neurons, spinal sensory neurons of dorsal root ganglia 
(lumbosacral region), and nodose ganglion neurons, and 2) immunocrossreactivity in an ELISA 
assay (Lucas, et a!., 1 989, J. Endocrinol. 120:449) utilizing polyclonal antibodies to human 
/?-NGF, which can be generated as described above in the Description Section using fi-NGF 
as immunogen rather than NT-4. The S-Sepharose eluant isdialyzed into 1 M acetic acid and 
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4 M urea, concentrated 10-fold, applied to a S-300 Sephacryl gel-filtration column (1 .5 cm. 
x 44 cm.), and chromatographed in the same buffer. 

Aliquots of 200 v\ are taken from each 1 mi fraction collected, dialyzed against 1 M 
acetic acid, iyophilized, and redissolved in 30 //I Laemmli SDS-PAGE sample buffer (Laemmli, 
5 1970, Nature 227:680). Human /MNJGF is obtained in a similar manner. Following SDS- 
PAGE, the silver-stained gel indicates a single, prominently stained polypeptide of 
approximately 15 kD. A 3-ml pool of S-300 column eluted fractions corresponding to this 
SDS-PAGE analyzed region is made, and 1 ml (0,5 nmole) is submitted to N-terminal amino 
' acid sequence analysis by Edman degradation performed on a prototype automated amino 

1 0 acid sequencer (Kohr, EP Pat. Pub. No. 257,735). N-terminal sequence analysis gives a single 
sequence starting with a glycine residue predicted by the tetrabasic cleavage sequence 

i 

ending in an arginine, and predicted by the processing of preproNGF to mature /?-NGF. 

The initial sequencing cycles may be quantitated to indicate the amount of recovery 

of the purified human NT-4 from the three-column process. The purified recombinant human 
1 5 NT-4 is dialyzed into 0.1 % acetic acid to give a final concentration of 3.25 //g/ml. This stock 

material may be diluted into neuronal cell media (DMEM high glucose with 10% fetal bovine 

serum) at various concentrations from 4 to 60 ng/ml for carrying out various bioassays, 

For larger-scale production of NT-4, the preferred vector is a SV40-driven vector such 

as pSVI6B described above, the preferred host cells are Chinese hamster ovary cells, and the 
20 preferred culture medium is a DMEM or 1:1 DMEM:F12 medium with levels of glucose 

elevated to optimize product yield or the serum-free medium described in U.S. Pat. No. 

4,767,704. 

Purified NT-4 is analyzed for neurotrophic activities on several types of primary 
embryonal day-10 chick neurons as described by Davies, in Nerve Growth Factors , 1989, 

25 (R.A. Rush, Ed., John Wiley & Sons, Boston), pp. 95-109. Thus, paravertebral sympathetic 
chain ganglia (SG), dorsal root (lumbosacral) ganglia (DRG), and nodose ganglia (NG) are 
dissected from day-1 0 chick embryos. The neuronal cells are dispersed from the ganglia with 
trypsin or pancreatin (GIBCO) and preplated twice to reduce the number of non-neuronal cells. 
Cells are counted and seeded in a 96-well tissue culture plate that had been pretreated with 

30 polyornithine (500 //g/m!) and laminin (10/yg/ml). (Lindsay, et al., 1985, Dev. Biol. 1 12:319). 
The cell seeding numbers are SG and DRG, 4000 cells per well; NG, 2000 cells per well. 

Purified mouse submaxillary gland £-NGF used in the assays is obtained from 
Biomedical Technologies, Inc. and dissolved in 0.1% acetic acid to a concentration of 10 
//g/ml. Purified recombinant human NT-4 dialyzed into 0.1% acetic acid at a final 

35 concentration of 3.25 /yg/ml is used. Cells are incubated with or without the factors for 48 
hours and phase-bright cell bodies which had elaborated neurites 5x the length of the cell 
body are counted. Individual perikaryons can be counted in the cultures of DRG and NG 
neurons. However, the perikaryons of SG neurons aggregate and cell aggregates are scored. 
The cell survival at maximal response is approximately 20-40% for DRG and NG neurons, 
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whereas SG neurons are likely higher since aggregates are scored. Four experiments are 
carried out utilizing each of NGF and NT-4. 

NT-4 is expected to be most active on peripheral neurons. In vertebrates, peripheral 
neurons are derived from two distinct embryonic sources: the neural crest and the neural 
5 placodes (LeDouarin and Smith, 1 988, Ann. Rev. Cell Biol. 4:375). Neural crest-derived cells 
give rise to neurons and to the supporting cells of the peripheral nervous system and the 
placode-derived cells give rise to some sensory cells and cranial neurons. 

The neural crest-derived dorsal root sensory ganglia (DRG) cells project to the CNS and 
to peripheral tissues, and are dependent on neurotrophic factors derived from both targets. 
10 (Lindsay, et al., 1985, Dev. Biol. 1 1 2 :31 9). This dual dependency is a possible mechanism 
to ensure the survival only of neurons that form all the appropriate connections. Placode- 
derived nodose sensory ganglia (NG), whiph are also dually connected and respond to the 
CNS factor BDNF, do not respond to the peripherally derived trophic factor (NGF). Thus, 
peripheral target innervation by NG neurons fs likely to be ensured by an alternative 
15 mechanism or via other factors. 

The presence of NT-4 in the brain and the periphery suggests additional functions and 
raises the possibility that it could be valuable for treating diseases such as Alzheimer's, 
Parkinson's, or Huntington's chorea that are caused by brain neuron degeneration and/or 
treating damaged nerves due to trauma or preventing damage to peripheral nerve cells. NT-4 
20 could be tested for central neurological functions in an established animal lesion model such 
as that of Hefti, supra , or in aged rats or monkeys. 

EXAMPLE til 

To identify naturally occurring amino acid sequence variants of NT-4, the genomic DNA 
fragment described above, comprising the coding sequence for mature human NT-4, was 

25 used as a hybridization probe to screen for homologous DNAs in the human fetal brain cDNA 
library (Rosenthal, et al., 1987, EMBO J., 6:3641) and in a human lymphocyte genomic DNA 
library (Stratagene, La Jolla, CA). 

Hybridization and washing of filters containing the library DNAs were performed under 
high stringency conditions: Hybridization of radiolabeled NT-4 probe to the filters was 

30 performed in a solution of 50% formamide, 5x SSC (1x = 0.15 M NaCI, 0.015 M sodium 
citrate), 0.1% sodium dodecyl sulfate (SDS), 0.1% sodium pyrophosphate, 50mM sodium 
phosphate pH 6.8, 2x Denhardt's solution (1x = 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 
0.02% bovine serum albumin), 10% dextran sulfate, at 42°C. for 20 hrs. Washing of the 
filters was performed in an aqueous solution of 0.1 x SSC, 0.1 % SDS at 42°C. 

35 Three DNAs were identified that had significant sequence homology with the DNA 

encoding mature human NT-4. The complete nucleotide sequences of those homologous 
DNAs is shown in Figures 3, 4, and 5, along with the deduced amino acid sequence of the 
polypeptides they encode, which polypeptides are referred to as NT-4£, NT-4p, and NT-4 A, 
respectively. The DNA encoding human NT-4/?, having the sequence shown in Figure 3, was 
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isolated from the human fetal brain cDNA library, the nucleotide sequence shown in Figure 

3 appears to encode a portion of human NT-4/S. A full length cDNA, encoding the entirety 
of human NT-4/?, is readily obtained by probing the human fetal brain cDNA library with the 
cDNA disclosed in Figure 3. The DNA encoding human NT-4p, having the sequence shown 

5 in Figure 4, was isolated from the human lymphocyte genomic DNA library. The DNA 
encoding human NT-4A, having the sequence shown in Figure 5, also was isolated from the 
human lymphocyte genomic DNA library. 

frgure 6 shows the homologies among the amino acid sequences of human NT-4, NT- 
4/3, NT-4p, and NT-4A. The amino acid sequence of human NT-4 has at least about 7.5% 
10 sequence homology (identity) to each of NT-4^ NT-4K/ and NT-4A, based on the alignment 
of the amino acid sequences as shown in Figure 6. As is apparept, NT-4/?, NT-4y, and NT-4A 
are amino acid sequence variants of human' NT-4, as defined herein, differing from human NT- 

4 by virtue of various amind acid insertions, and substitution 

Beceuse NT-4/?, NT-4^ and NT-4A are naturally occurring amino acid sequence 
15 variants of human NT-4, it is expecte,d that NT-4& NT-4k, anti NT-4A, like NT-4, have a role 
in regulating the normal growth and/or development of vertebrate neural tissue. NT-4/?, NT- 
4/, and NT-4A are readily produced by recombinant means by expression in a suitable host 
cell transformed with an expression vector comprising DNA encoding those polypeptides, as 
described above. NT-4# NT-4^, and NT-4A are analyzed for neurotrophic activities as 
20 described above for NT-4. 

In summary, NT-4 is a novel trophic factor with a broad tissue distribution. It 
complements NGF, BDNF, and NT-3, which are trophic factors for some peripheral neurons. 
NT-4/?, NT-4y, and NT-4 A are novel amino acid sequence variants of NT-4. Each of these 
factors can likely act alone or in concert on defined subsets of neurons to achieve the correct 
25 neuronal connections both in the peripheral and central nervous system. 
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SEQUENCE LISTING ' 

'(1) GENERAL INFORMATION: 

5 (i) APPLICANT: GENENTECH, INC, 

ROSENTHAL, ARNON 

(ii) TITLE OF INVENTION: NOVEL NEUROTROPHIC FACTOR 

10 (iii) NUMBER OF SEQUENCES: 100 , 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 
15 (C) CITY: South San Francisco 

' (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS— DOS 

(D) SOFTWARE: pat in (Genentech) 

25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 24-SEP-1991 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

<A) APPLICATION NUMBER: 07/648482 
(B) APPLICATION DATE: 31-JAN-1991 

35 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/587707 

(B) APPLICATION DATE: 25-SEP-1990 

(viii) ATTORNEY /AGENT INFORMATION: 
40 (A) NAME: Hensley, Max D. 

(B) REGISTRATION NUMBER: 27,043 

(C) REFERENCE /DOCKET NUMBER: 666P2 

(ix) TELECOMMUNICATION INFORMATION: 
45 (A) TELEPHONE: 415/266-1994 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 



30 



50 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
60 ATGCTCCCTC TCCCCTCATG CTCCCTCCCC ATCCTCCTCC TTTTCCTCCT 50 

CCCCAGTGTG CCAATTGAGT CCCAACCCCC ACCCTCAACA TTGCCCCCTT 100 

65 

TTCTGGCCCC TGAGTGGGAC CTTCTCTCCC CCCGAGTAGT CCTGTCTAGG 150 
GGTGCCCCTG CTGGGCCCCC TCTGCTCTTC CTGCTGGAGG CTGGGGCCTT 200 
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TCGGGAGTCA GCAGGTGCCC CGGCCAACCG CAGCCGGCGT GGGGTGAGCG 250 
AAACTGCACC AGCGAGTCGT CGGGGTGAGC TGGCTGTGTG CGATGCAGTC 300 

AGTGGCTGGG TGACAGACCG CCGGACCGCT GTGGACTTGC GTGGGCGCGA 350 

i 1 

GGTGGAGGTG TTGGGCGAGG TGCCTGCAGC TGGCGGCAGT CCCCTCCGCC 400 
i 

AGTACTTCTT TGAAACCCGC TGCAAGGCTG ATAACGCTGA GGAAGGTGGC 450 
CCGGGGGCAG GTGGAGGGGG CTGCCGGGGA GTGGACAGGA GGCACTGGGT 500 
' 20 ATCTGAGTGC AAGGCCAAGC AGTCCTATGT GCGGGCATTG ACCGCTGATG 550 

CCCAGGGCCG TGTGGGCTGG CGATGGATTC GAATTGACAC TGCCTGCGTC 600 
TGCACACTCC TCAGCCGGAC TGGCCGGGCC TGAG 634 



10 



'15 



25 



30 (2) INFORMATION FOR SEQ ID NO: 2 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Pro Leu Pro Ser Cys Ser Leu Pro He Leu Leu Leu Phe 
40 1 5 10 15 

Leu Leu Pro Ser Val Pro He Glu Ser Gin Pro Pro Pro Ser Thr 
20 25 30 

45 Leu Pro Pro Phe Leu Ala Pro Glu Trp Asp Leu Leu Ser Pro Arg 

35 40 45 



50 



65 



Val Val Leu Ser Arg Gly Ala Pro Ala Gly Pro Pro Leu Leu Phe 

50 55 60 

Leu Leu Glu Ala Gly Ala Phe Arg Glu Ser Ala Gly Ala Pro Ala 

65 70 * 75 



Aen Arg Ser Arg Arg Gly Val Ser Glu Thr Ala Pro Ala Ser Arg 
55 80 85 90 

Arg Gly Glu Leu Ala Val Cys Asp Ala Val Ser Gly Trp Val Thr 
95 100 " 105 

60 Asp Arg Arg Thr Ala Val Asp Leu Arg Gly Arg Glu Val Glu Val 

110 115 120 



Leu Gly Glu Val Pro Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr 

125 130 135 

Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly 

140 145 150 

Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val Asp Arg Arg His 
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155 160 • 9 165 

Trp 'Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu 

170 ~ 175 ^ 180 

Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp He Arg He 

185 190 195 



5 



Asp Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
10 200) 205 210 

(2) INFORMATION FOR SEQ ID NO: 3: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 
, (B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: ( 

Met Thr He Leu Phe Leu Thr Met Val He Ser Tyr Phe Gly Cys 
1 5 i ' 10 15 

25 Met Lys Ala Ala Pro Met Lys Glu Ala Asn He Arg Gly Gin Gly 

20 25 30 



30 



45 



60 



Gly Leu Ala Tyr Pro Gly Val Arg Thr His Gly Thr Leu Glu Ser 

35 " 40 45 

Val Asn Gly Pro Lys Ala Gly Ser Arg Gly Leu Thr Ser Leu Ala 

50 55 60 



Asp Thr Phe Glu His Met He Glu Glu Leu Leu Asp Glu Asp Gin 

35 65 70 75 

Lys Val Arg Pro Asn Glu Glu Asn Asn Lys Asp Ala Asp Leu Tyr 

80 85 90 

40 Thr Ser Arg Val Met Leu Ser Ser Gin Val Pro Leu Glu Pro Pro 

95 100 105 



Leu Leu Phe Leu Leu Glu Glu Tyr Lys Asn Tyr Leu Asp Ala Ala 

110 115 120 

Asn Met Ser Met Arg Val Arg Arg His Ser Asp Pro Ala Arg Arg 

125 130 135 



Gly Glu Leu Ser Val Cys Asp Ser He Ser Glu Trp Val Thr Ala 

50 140 145 150 

Ala Asp Lys Lys Thr Ala Val Asp Met Ser Gly Gly Thr Val Thr 

155 160 165 

55 Val Leu Glu Lys Val Pro Val Ser Lys Gly Gin Leu Lys Gin Tyr 

170 175 180 



Phe Tyr Glu Thr Lys Cys Asn Pro Met Gly Tyr Thr Lys Glu Gly 

185 190 " 195 

Cys Arg Gly He Asp Lys Arg His Trp Asn Ser Gin Cys Arg Thr 

200 205 210 



Thr Gin Ser Tyr Val Arg Ala Leu Thr Met Asp Ser Lys Lys Arg 

65 215 220 " 225 

He Gly Trp Arg Phe He Arg He Asp Thr Ser Cys Val Cys Thr 

230 235 ~ 240 
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Leu Thr lie Lys Arg Gly Arg 1 
245 247 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 257 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser lie Leu Phe iTyr Val lie Phe Leu Ala Tyr Leu Arg Gly 
! 5 10 15 

lie Gin Gly Asn Asn Met Asp Gin Arg Ser Leu Pro Glu Asp Ser 
20 25 30 



Leu Asn Ser Leu lie He Lys Leu He Gin Ala Asp He Leu Lys 

'20 35 40 45 

Asn Lys Leu Ser Lys Gin Met Val Asp Val Lys Glu Asn Tyr Gin 

50 55 60 

25 Ser Thr Leu Pro Lys Ala Glu Ala Pro Arg Glu Pro Glu Arg Gly 

65 70 75 



Glv Pro Ala Lys Ser Ala Phe Gin Pro Val He Ala Met Asp Thr 

80 85 90 

Glu Leu Leu Arg Gin Gin Arg Arg Tyr Asn Ser Pro Arg Val Leu 

95 ~ 100 105 



Leu Ser Asp Ser Thr Pro Leu Glu Pro Pro Pro Leu Tyr Leu Met 

35 HO 115 120 

Glu Asp Tyr Val Gly Ser Pro Val Val Ala Asn Arg Thr Ser Arg 

125 130 135 

40 Arg Lys Arg Tyr Ala Glu His Lys Ser His Arg Gly Glu Tyr Ser 

140 145 150 



Val Cys Asp Ser Glu Ser Leu Trp Val Thr Asp Lys Ser Ser Ala 

155 160 165 

He Asp He Arg Gly His Gin Val Thr Val Leu Gly Glu He Lys 

170 175 180 



Thr Gly Asn Ser Pro Val Lys Gin Tyr Phe Tyr Glu Thr Arg Cys 

50 - 185 190 195 

Lys Glu Ala Arg Pro Val Lys Asn Gly Cys Arg Gly He Asp Asp 

200 205 210 

55 Lys His Trp Asn Ser Gin cys Lys Thr Ser Gin Thr Tyr Val Arg 

215 220 225 



Ala Leu Thr Ser Glu Asn Asn Lys Leu Val Gly Trp Arg Trp He 

230 235 240 

Arq He Asp Thr Ser cys Val Cys Ala Leu Ser Arg Lys He Gly 

245 " 250 255 



Arg Thr 
65 257 



(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
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{ A) LENGTH : 241 amino acids . v 

<B) TYPE: amino acid 
• (D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Met Leu Phe Tyr Thr Leu lie Thr Ala Phe Leu lie Gly 
1 5 10 " IS 

10 lie Gin Ala Glu Proi His Ser Glu Ser Asn Val Pro Ala Gly His 

20 25 30 



15 



30 



45 



Thr He Pro Gin Val His Trp Thr Lys Leu Gin His Ser Leu Asp 

1 35 , 40 45 

Thr Ala Leu Arg Arg Ala Arg Ser Ala Pro Ala Ala Ala He Ala 

50 55 60 



Ala Arg Val Ala Gly Gin Thr Arg Asn lie Thr Val Asp Pro Arg 

20 65 70 ' 75 

Leu Phe Lys Lys Arg Arg Leu Arg Ser Pro Arg Val Leu Phe Ser 

80 i 85 90 

25 Thr Gin Pro Pro Arg Glu Ala Ala Asp Thr Gin Asp Leu Asp Phe 

95 100 105 



Glu Val Gly Gly Ala Ala Pro Phe Asn Arg Thr His Arg Ser Lys 

110 115 " 120 

Arg Ser Ser Ser His Pro He Phe His Arg Gly Glu Phe Ser Val 

125 130 135 



Cys Asp Ser Val Ser Val Trp Val Gly Asp Lys Thr Thr Ala Thr 

35 140 145 150 

Asp He Lys Gly Lys Glu Val Met Val Leu Gly Glu Val Asn lie 

155 160 165 

40 Asn Asn Ser Val Phe Lys Gin Tyr Phe Phe Glu Thr Lys Cys Arg 

170 175 ~ 180 



Asp Pro Asn Pro Val Asp Ser Gly Cys Arg Gly lie Asp Ser Lys 
185 190 195 

His Trp Asn Ser Tyr Cys Thr Thr Thr His Thr Phe Val Lys Ala 
200 205 210 



Leu Thr Met Asp Gly Lys Gin Ala Ala Trp Arg Phe He Arg He 
50 215 220 225 

Asp Thr Ala Cys Val Cys Val Leu Ser Arg Lys Ala Val Arg Arg 
230 235 240 

55 Ala 
241 

(2) INFORMATION FOR SEQ ID NO: 6: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Pro Arg Val Val Leu Ser Arg Gly Ala Pro Ala Gly Pro Pro 
1 5 10 15 
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Leu Leu Phe Leu Leu* Glu Ala Gly Ala Phe Arg Glu Ser Ala Gly' 
20 '25 30 

Ala Pro Ala Asn Arg Ser Arg Arg Gly Val Ser Glu Thr Ala Pro 
5 35 40 45 

Ala Ser Arg Arg Gly Glu Leu Ala Val Cys Asp Ala Val Ser Gly 
50 ,55 60 

10 Trp Val Thr Asp Arg 1 Arg Thr Ala Val Asp Leu Arg Gly Arg Glu 

65 70 75 



15 



30 



35 



50 



55 



65 



Val Glu Val Leu Gly Glu Val Pro Ala Ala Gly Gly Ser Pro Leu 

80 ' 85 90 

Arg Gin Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala Glu 

95 100 105 



Glu Gly Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val Asp 
20 110 115 120 

Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val 
125 130 ( 135 

25 Arg Ala 1 Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp 

140 145 150 



lie Arg He Asp Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr 
155 " 160 165 

Gly Arg Ala 
168 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 685 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
45 CGAGAGATGC TCTGAGAGAT GCTCCCACTC CCCCAGGCTC CCTCCGCATC 50 



CCCCTCATTT TCCTCCTCCC CAGTGTGTCA ATGGAGTCCT AACCCCATCC 100 



TCGACATTGT CGCCTTTTCC TCCTCCAGAG TGGGACCTTC TTTTCCCCCG 150 
AGTGGTCCTG TCTAGGGGTG CCGCTGCCGG GCCCCCTCTG GTCTTCCTGC 200 
TGGAGACTGG AGCCTTTCGG GAGTCAGCAG GCGCCCGGGC CAACCGCAGC 250 
60 CAGCGAGGGG TGAGCGATAC TTCACCGGCG AGTCATCAGG GTGAGCTGGC 300 

CGTGTGCGAT GCAGTCAGTG TCTGGGTGAC AGACCCCTGG ACTGCTGTGG 350 
ACTTGGGTGT GCTCGAGGTG GAGGTGTTGG GCGhGGTGCC TGCAGCTGTC 400 
GGCAGTTCCC TCCGCCAGCA CTTCTTTGTT GCCCGCTTCG AGGCCGATAA 450 
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50 



65 
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ATCTGAGGAA GGTGGCCCGG GGGTAGGTGG AGGGGCTGCC GCCGGGGTGT 5O0 
GGACCGGGGG GCACTGGGTG TCTGAGTGCA AGGCCAAGCA GTCCTATGTG 550 
CGGGCATTGA CCGCTGATGC CCAGGGCCGT GTGGACTGGC GATGGATTCA 600 
AATTGGCACA GCCTGTGTCT GCACACTCCT CAGCCGGACT GGCCGGGCCT 650 
GAGACTTATA CCCAG G AACT GGTCAGGCAG AAAAA 685 



(2) INFORMATION FOR SEQ ID NO: 8: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Glu Arg Cys Ser Glu Arg Cys Ser His Ser Pro Arg Leu Pro Pro 
1 5 10 15 

30 His Pro Pro His Phe Pro Pro Pro Gin Cys Val Asn Gly Val Leu 

20 25 ~ 30 



Thr Pro Ser Ser Thr Leu Ser Pro Phe Pro Pro Pro Glu Trp Asp 
35 40 45 

Leu Leu Phe Pro Arg Val Val Leu Ser Arg Gly Ala Ala Ala Gly 
50 55 60 



Pro Pro Leu Val Phe Leu Leu Glu Thr Gly Ala Phe Arg Glu Ser 
40 65 70 7 5 

Ala Gly Ala Arg Ala Asn Arg Ser Gin Arg Gly Val Ser Asp Thr 
80 85 90 

45 Ser Pro Ala Ser His Gin Gly Glu Leu Ala Val Cys Asp Ala Val 

95 100 " 105 



Ser Val Trp Val Thr Asp Pro Trp Thr Ala Val Asp Leu Gly Val 
110 115 120 

Leu Glu Val Glu Val Leu Gly Glu Val Pro Ala Ala Val Gly Ser 
125 130 135 



Ser Leu Arg Gin His Phe Phe Val Ala Arg Phe Glu Ala Asp Lys 
55 140 145 150 

Ser Glu Glu Gly Gly Pro Gly Val Gly Gly Gly Ala Ala Ala Gly 
155 160 165 

60 Val Trp Thr Gly Gly His Trp Val Ser Glu Cys Lys Ala Lys Gin 

170 175 180 



Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Asp 
185 190 " 195 

Trp Arg Trp lie Gin lie Gly Thr Ala Cys Val Cys Thr Leu Leu 
200 205 - 210 

Ser Arg Thr Gly Arg Ala 
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215 216 ' 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1190 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

i 

ACTGGAGCGC AGCACCACGC CCAGCTAATT TTGGTATTAT CAGTAGAGAT 50 
GTTGTTTCAC AGTGTTGGCC AGGCTGCTCT CAAACTCCTG ACCTCAAGTC 100 
AAACACCCGC CTCAGCCTCC CAAAGTGCTG GGACTACAGG TGTGAGCCAT 150 
AGTGCCTGAC CTGTAGTTGT TGAATATTTA TTATTAATCT ACAAGTTGGG 200 
TGTGATGCAA GTCCTTTATA TGGAGTCCCC CAAACTTCTA GAGCAAGGGC 250 
TTCCCCATAA TCCTGGCAGG CAGGCCTCCC CTGGGGTTCC CAACTTCTGA 300 
CCCCACTGAA GTGTTTATCT TCTTCCCTAA TCCCAGCCTC CTTTTCCCTG 350 
TCTCCATGTG CTCTGAGAGA TGCTCTGAGA GATGCTCCTG CTCCCCCAGG 400 
CTCCCTCCGC ATCCCCCTCA TTTTCCTCCT CCCCAGTGTG TCATTGGAGT 450 
CCTAACCCCA TCCTCGACAT TGTCGCGTTT TCCTCCTCCA GAGTGGGACC 500 
TTCTTTTCCC CCGAGTGGTC CTGTCTAGGG GTGCCGCTGC CGGGCCCCCT 550 
CTGGTCTTCC TGCTGGAGAC TGGAGCCTTT CGGGAGTCAG CAGGCGCCCG 600 
GGCCAACCGC AGCCAGCGTG GGGTGAGCGA TACTTCACCG GTGAGTCATC 650 
AGGGTGAGCT GGCCGTGTGC GATGCAGTCA CTGTCTGGGT GACAGACCCC 700 
TGGACTGCTG TGGACTTGGG TGTGCTCGAG GTGGAGGTGT TGGGTGAGGT 750 
GCCTGCAGCT GGCAGCAGTT CCCTCCGCCA GCACTTCTTT GTTACCCGCT 800 
TCGAGGCCGA TAAATCTAAG GAAGGTGGCC CGGGGGTAGG TGGAGGACCT 850 
GCCGCCGGGG TGTGGACCGG GGGGCACTGG GTGTCTGAGT GCAAGGCCAA 900 
GCAGTCCTAT GGGCGGG CAT TGACCACTGA TGCCCAGGGC CGTGTGGACT 950 
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. ggcgatggat tcAaattggc actgcctgtg tctgcacact cctcAgccgg 1000 

ACTGGCCGGG CCTGAGACTT ATACGCAGGA ACTGGTCAGG CAGAAAAAGA 1050 

5 

ACAGAGCTGG ATGCTGAGAG ACCTCAGGGT TGGCCCAGCT GCTCTACGGA 1100 

10 CGGACCCCAG TTGGGGAACT ' CATCAAATCA TCGCAAAATC TCAACTGTCT 1150 

. GAATTTGAGC TCAATCTCTG TAGGATGGGT GCAACAATGT 1190 



15 



25 



40 



55 



(2) > INFORMATION FOR SEQ ID NO:10: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 257 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

i • 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Ser Lys Gly Phe Pro He He Leu Ala Gly Arg Pro Pro Leu Gly 
1 5 10 15 



Phe Pro Thr Ser Asp Pro Thr Glu Val Phe He Phe Phe Pro Asn 

30 20 25 30 

Pro Ser Leu Leu Phe Pro Val Ser Met .Cys Ser Glu Arg cys Ser 

35 40 45 

35 Glu Arg Cys Ser Cys Ser Pro Arg Leu Pro Pro His Pro Pro His 

50 55 60 



Phe Pro Pro Pro Gin Cys Val He Gly Val Leu Thr Pro Ser Ser 
65 70 75 

Thr Leu Ser Arg Phe Pro Pro Pro Glu Trp Asp Leu Leu Phe Pro 
80 85 90 



Arg -Val Val Leu Ser Arg Gly Ala Ala Ala Gly Pro Pro Leu Val 
45 95 100 105 

Phe Leu Leu Glu Thr Gly Ala Phe. Arg Glu Ser Ala Gly Ala Arg 
110 115 120 

50 Ala Asn Arg Ser Gin Arg Gly Val Ser Asp Thr Ser Pro Val Ser 

125 130 135 



His Gin Gly Glu Leu Ala Val Cys Asp Ala Val Thr Val Trp Val 
140 145 * 150 

Thr Asp Pro Trp Thr Ala Val Asp Leu Gly Val Leu Glu Val Glu 
155 160 165 



Val Leu Gly Glu Val Pro Ala Ala Gly Ser Ser Ser Leu Arg Gin 

60 170 175 180 

His Phe Phe Val Thr Arg Phe Glu Ala Asp Lys Ser Lys Glu Gly 

185 190 * 195 

65 Gly Pro Gly Val Gly Gly Gly Pro Ala Ala Gly Val Trp Thr Gly 

200 205 210 

Gly His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Gly Arg 

215 220 225 
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Ala Leu Thr Thr Asp' Ala Gin Gly Arg Val Asp Trp Arg Trf> He 
230 ' ' 235 240 

Gin lie Gly Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr Gly 
245 250 255 

Arg Ala 
257 

(2) INFORMATION FOR £eQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 971 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

i 

TTTGAACTCC TGACCTCAAG TCAAACACCG CCTCAGCCTC CCAAAGTGCT 50 

i ' 

GGGACCACAG GTGTGAGCCA TAGTGCCTGA CCTGTAGTTG TTGAATATTT 100 
ATTATTAATC TACAAGTTGG' GTGTGATGCA AGTCCCATAC ATGGAGTCCC 150 
CCAAACTTCT AGAGCAAGGG CTTCCCCATA ATCCTGGCAG GCAGGCCTCC 200 
CCTGGGGTTC CCAACTTCTG ACdTCACTGA AGTGTTTATC CTCTTCTCTA 2 50 
ATCCCAGCCT CCTTTTCCCT GTTTCCATGT CCTCTGAGAG ATGCTCCCGC 300 
TCCCCCAGGC TCCCTCTGCA TCCCCCTCAT TTGCTTCCTC CCCAGTGTGT 350 
CAATGGAGTC CTAACCCCCA CCCTTGACAT TGTCCCCTTT TCCTCCTCCA 400 
GAGTGGGACC TTATTTTCCC CCAAGTGGTC CTGTCTAGGG GTGCCGCTGC 450 
CGGGCCCCCT CTGGTCTTCC TGCTGGAGAC TGGGGCCTTT TGGGAGTCAG 500 
CAGGCACCCG GGCCAACCGC AGCCAGCGAG GGGTGAGCGA TACTTCACCG 550 
GCGAGTCATC AGGGTGAGCT GGCCGTGTGC GATGCAGTCA GTGTCTGGGT 600 
GACAGACCCC CGGACCGCTG TGGACTTGGT TGTGCTCGAG GTGGAGGTGT 650 
TGGGTGAGGT GCCTGCAGCT GGCAGCAGTT CCCTCCACCA ACACTTCTTT 700 
GTCACCTGCT TCAAGGCCGA TAACTCTGAA GAAGGTGGCC CAGGGGTAGG 750 
TGGAGGGGCT GCCGCTGGGG TGTGGACCGG GGGGCACTGG GTGTCTGAGT 800 
GCAAGGCCAA GCAGTCCTAT GTGCGGGCAT TGACCGCTGA TGCCCAGGGC 850 
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CGTGTGGACT GGCGATGGAT TCAAACTGGC ACAGCCTGTG TCTGCACACT 900 1 
5 CCTCAGCCGG ACTGGCCGGG CCTGAGACTT ATACCCAGGA ACTGGTCAGG 950 

CAGAAAAAGA ACAGAGCTAG G 971 



10 



20 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 186 amino acids 

' (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Pro Pro Pro Leu Thr Leu Ser Pro Phe Pro Pro Pro Glu Trp Asp 
15 10 15 



Leu He Phe Pro Gin Val Val Leu Ser Arg Gly Ala Ala Ala Gly 
25 20 25 30 

Pro Pro Leu Val Phe Leu Leu Glu Thr Gly Ala, Phe Trp Glu Ser 
35 40 ~ 45 

30 Ala Gly Thr Arg Ala Asn Arg Ser Gin Arg Gly Val Ser Asp Thr 

50 55 60 



Ser Pro Ala Ser His Gin Gly Glu Leu Ala Val Cys Asp Ala Val 

65 70 75 

Ser Val Trp Val Thr Asp Pro Arg Thr Ala Val Asp Leu Val Val 

80 85 90 



Leu Glu Val Glu Val Leu Gly Glu Val Pro Ala Ala Gly Ser Ser 

40 95 100 105 

Ser Leu His Gin His Phe Phe Val Thr Cys Phe Lys Ala Asp Asn 

110 115 120 

45 Ser Glu Glu Gly Gly Pro Gly Val Gly Gly Gly Ala Ala Ala Gly 

125 130 135 



Val Trp Thr Gly Gly His Trp Val Ser Glu Cys Lys Ala Lys Gin 

140 145 150 

Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Asp 

155 160 165 



Trp Arg Trp He Gin Thr Gly Thr Ala Cys Val Cys Thr Leu Leu 
55 170 175 180 

Ser Arg Thr Gly Arg Ala 
185 186 

60 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
65 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
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1 5 1 10 ' 15' 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

5 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 , 45 

Ala Ala Gly Gly Ser Pro Leu > Arg Gin Tyr Phe Phe Glu Thr Arg 

10 50. 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

i 65 70 75 

15 Gly Gly Lys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 



20 



35 



50 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 ( 105 

Gly Arg Val Gly Trp Arg Trp Ile'Arg lie Asp Thr Ala Cys Val 
110 , . * 115 120 



Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
25 ' . 125 130 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
40 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 45 

45 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 



Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 

Gly Gly His Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
55 " 95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys Val 
110 115 120 

60 Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 130 

(2) INFORMATION FOR SEQ ID NO: 15: 

65 <i) SEQUENCE CHARACTERISTICS: 

(A) LEITGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTlbN: SEQ ID NO: 15: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 X0 15 

5 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu t Gly Glu Val Pro 

10 35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 

,15 Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 1 70 75 



20 



60 



65 



Gly Gly Gin Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 ~ 105 



Gly Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys Val 
25 HO 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

30 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
40, 15 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 *" 30 

45 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 ' 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 50 " 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

Gly Gly Arg Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 
55 80 85 90 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 ~ 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
110 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

5 

Glv Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 , 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp ,Arg Arg Thr Ala 
10 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35, 40 45 

15 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 ' 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
J cc m 1F> 



' 20 



65 70 " 75 

Gly Gly Cys Arg Gly Val Asp Arg Arg Glu Trp Val Ser Glu Cys 

80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

25 95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg He As£ Thr Ala Cys Val 

110 115 120 

30 Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 130 

(2) INFORMATION FOR SEQ ID NO: 18: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

15 10 15 

45 Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 2 5 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 

50 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 5 5 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

55 65 70 75 

Gly Gly Cys Arg G. / Val Asp Arg Arg Phe Trp Val Ser Glu Cys 

SO 85 90 

60 Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys Val 

110 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 130 



65 



(2) INFORMATION FOR SEQ ID NO: 19: 
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10 



25 



55 
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(i) SEQUENCE CHARACTERISTICS: • r 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 |0 15 

i 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 



Val Asp Leu Arg Gly Ajrg Glu Val Glu Val Leu Gly Glu Val Pro 

15 . ' 35 40 * 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 

20 Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly 'Ala Gly Gly 

65 70 " 75 



Gly Gly Cys Arg Gly Val Asp Arg Arg Pro Trp Val Ser Glu Cys 
80 85 ' i 90 

Lys Ala' Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 



Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
30 110 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

35 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 130 amino acids 
(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
45 1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

50 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 ' 45 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 * 60 

CyB Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 



Gly Gly Cys Arg Gly Val Asp Arg Arg Tyr Trp Val Ser Glu Cys 

60 80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 

65 Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 

HO 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 
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(2) INFORMATION FOR ^EQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
5 ( B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

10 Gly Val Ser Glu Thr'Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala ! 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 ' 25 30 

15 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

^35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
20 50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
* 65 70 75 

j 

25 Gly Gly' Cys Arg Gly Val Asp Arg Arg Trp Trp Val Ser Glu Cys 
* 80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
110 H5 . I 20 



30 



45 



60 



Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
35 125 130 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
50 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 45 

55 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

en *k*s 60 



50 " 55 60 

Cys Lys Ala Asp Asn Ala Ser Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

65 95 100 105 

Gly Arg val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 

110 H5 120 
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Cys Thr Leu Leu Ser Arg Thr Gly Ar^ 'Ala 1 
125 130 

(2) INFORMATION FOR SEQ ID NO: 23: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid ' 
(D) TOPOLOGY: linear 

i 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



10 



15 



Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 2 5 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
20 35 40 45 



25 



30 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cys Lys Ala Asp Asn Ala Thr Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

> 

Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 
80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
35 HO H5 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

40 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
<B) TYPE: amino acid 

45 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Arg Arg His Trp Val Ser Glu Cys Lys Ala LyB Gin 
50 1 5 10 12 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
65 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 " 45 
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Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr. Phe' Phe Glu Thr Arg 
50 55 60 



15 



30 



35 



40 



50 



55 



60 



Cys 
61 



(2) INFORMATION FOR SEQ ID Np:26: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 



Val Cys 
20 17 

(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Cve Asp Ala Val ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 

1 5 10 15 

Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala 

20 25 30 

Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 62 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 

Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala 

20 25 30 

Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys 

35 40 45 

Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 

50 55 60 



Gly Cys 
62 



65 (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY a linear . ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

5 Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 

1 5 10 15 



10 



35 



40 



Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala 
20 . 25 30 

i 

Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys 
35 40 45 



Lys 'Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 

15 50 ' 55 ~ 60 

G\y Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

65 70 74 

20 (2) INFORMATION FOR SEQ ID NO: 30: i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103' amino acids 

(B) TYPE: amino acid i 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 30: 

Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 

30 1 5 10 15 

Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala 

20 ' 25 30 

Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys 

35 40 45 

Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 

50 55 " 60 

Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys 

65 70 75 



Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly 
45 80 85 90 

Arg Val Gly Trp Arg Trp lie Arg He Asp Thr Ala Cys 
95 100 103 

50 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 
60 1 5 10 15 

Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala 
20 25 30 

65 Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys 

35 40 45 

Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 
50 55 "* 60 
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25 



40 



55 
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Gly Cys Arg Gly Val Asp Arg Arg His Trp Val' Ser Glu Cys Lye 
65 70 " 75 

Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly 
80 85 90 

Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys Val Cys 
95 100 105 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 
"l§ (B) TYPE: amino acid 

( D ) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

20 Arg Gly Glu Leu Ala Val Cys Asp Ala Val Ser Gly Trp Val Thr 
15 10 15 



Asp Arg 
17 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

35 Arg Gly Glu Leu Ala Val Cys Asp Ala Val Ser Gly Trp Val Thr 
1 5 10 15 



Asp Arg Arg Thr Ala Val Asp Leu Arg 
20 24 

(2) INFORMATION FOR SEQ ID NO: 34 1 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

50 Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala Ala Gly 

15 10 15 



Gly Ser Pro Leu Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 amino acids 
60 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

65 Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

15 10 15 

Gly Gly Cys 
18 
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(2) INFORMATION FOR 1 SEQ ID NO: 36: ' ' 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

10 Arg Gin Tyr Phe Phe Glu Thr Arg Cys 

15 9 

(2) INFORMATION FOR SEQ ID NO:37: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 1 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

1 , 5 1 10 15 

i 

i 

25 Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

20 25 30 



30 



35 



40 



45 



50 



55 



60 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
35 40 45 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys 
50 .55 59 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
1 5 10 15 

Gly Gly Cys 
18 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala 
1 5 10 15 

Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arq 
20 25 30 



Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys 
65 35 40 42 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3b amino acids 1 1 

, (B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly P,ro Gly Ala Gly Gly 
1 5 10 15 

10 Gly Gly Cys Arg Gly 1 Val Asp Arg Arg His Trp Val Ser Glu Cys 

20 25 30 



15 



20 



25 



30 



35 



40 



65 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear ( 

(xi) SEQUENCE DESCRIPTION: SEQ'lD NO:41: 

Arg Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly 
1 5 10 '15 

Gly Gly Gly Cys 
19 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 
1 5 10 15 

Gly Cys Arg Gly Val Asp Arg Arg His Trp Val ser Glu Cys Lys 
20 25 30 



Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly 
45 35 40 45 

Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys 
50 55 58 

50 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 
60 1 5 10 15 

Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

5 

Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys 
1 5 10 15 

Gin Ser Tyr Val Arg 
10 20 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



25 



30 



35 



50 



65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys 
1 5 10 11 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg 
1 5 10 11 

(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

45 Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 10 15 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 - 45 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Tnr Arg 
55 50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 " 75 

60 Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Asp Thr Ala Cys Val 

110 115 120 

Cys Val Leu Thr Val Lys Arg Val Arg Arg 
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125 '130 
(2) INFORMATION FOR SEQ ID NO: 48: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID* NO:48: 

Val Leu Gly Glu Val Pro Ala Ala Gly Gly Ser Pro Leu Arg Girt 
15 10 15 

' 15 Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala Glu Glu Gly 

20 25 30 



20 



Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val Asp Arg Arg 
35 40 45 

His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala 

55 60 



50 



Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp lie Arg 

25 65 70 75 

He Asp Thr Ala Cys Val Cys Val Leu Thr Veil Lys Arg Val Arg 

80 85 90 

30 Arg 
91 

(2) INFORMATION FOR SEQ ID NO: 49: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Val Leu Gly Glu Val Pro Ala" Ala Gly Gly Ser Pro Leu Arg Gin 

15 10 15 

45 Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala Glu Glu Gly 

20 25 30 

Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg ,Gly Val Asp Arg Arg 

35 40 45 

50 

His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala 

50 55 60 

Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp He Arg 

55 65 70 75 

He Asp Thr Ala Cys Val Cys Ser Leu Thr He Lys Arg He Arg 

' 80 85 90 



60 Ala 
91 



(2) INFORMATION FOR SEQ ID NO: 50: 

65 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESQRIPTION: SEQ ID NO:- 50: , r 

Val .Leu Gly Glu Val Pro Ala Ala Gly Gly Ser Pro Leu Arg Gin 

1 5 10 15 

5 

Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala Glu Glu Gly 

20 25 30 

Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val Asp Arg Arg 
10 35, ' 40 45 

His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala 

50 55 60 

i 

15 Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp He Arg 

65 70 75 



20 



25 



30 



35 



50 



He Asp Thr Ala Cys Val Cys Thr Leu Ser Arg Lys Ala Gly Arg 
80 85 90 

i 

Arg Ala 
92 

(2) INFORMATION FOR SEQ ID NO: 51: 

. • - 1 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 132 amino acids 
(B) TYPE: amino acid ' 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Asp Asp Asp Ser Pro He Ala Arg Arg Gly Glu He Ser Val Cys 
1 5 10 15 

Asp Ser Val Ser Asp Trp Val Ser Ala Pro Asp Lys Asp Thr Ala 
20 25 30 



Val Asp He Lys Gly Asp Asp Val Met Val Leu Lys Lys Val Gly 
40 35 40 . 45 

He Asn His Ser Val Val Leu Gly Glu Val Pro Ala Ala Gly Gly 
50 55 60 

45 Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp 

65 70 75 



Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg 

80 85 "* 90 

Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys Gin 

95 100 105 



Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Gly 
55 110 115 ~ 120 

Trp Arg Trp He Arg He Asp Thr Ala Cys Val Cys 
125 130 132 

60 (2) INFORMATION FOR SEQ ID NO: 52: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
65 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Ser Ser His Pro He Phe His Arg Gly Glu Phe Ser Val Cys 
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! 4 10 • ' 15 

Asp'ser val Ser Val Trp Val Gly Asp Lye Thr Thr Ala Thr Asp 
20 25 30 

lie Lvs Gly Lys Glu Val Met Val Leu Gly Glu Val Asn He Asn 
35 40 , 45 

Asn Ser Val Val Leu Gly Glu Val Pro A=la Ala Gly Gly Ser Pro 
10 50' 55 60 : 

Leu Arc Gin Tyr Phe Phe Glu Thr Arg Cys Lys Ala Asp Asn Ala 
,65 70 75 

15 Glu Glu GIV Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val 

80 85 90 

Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr 
95 100 ! 105 

20 

Val Arg Ala Leu Thr Ala Asp Ala' Gin Gly Arg Val Gly Trp Arg 
110 115 120 

i ■ 

Trp He Arg He Asp Thr Ala Cys Val Cys Val C^s Val Leu .Ser 
2 5 * 125 130 135 

Arg Lys Ala Val Arg Arg Aid 
140 142 

30 (2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
35 (D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Glv Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
40 1 5 10 IB 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

45 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 



50 



Ala Ala Gly Gly Ser Pro Leu Arg Glu Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cvs Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

Gly Gly Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys 
55 80 85 90 

Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly 
95 100 105 

60 Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val Cys 

110 H5 120 



65 



Thr Leu Leu Ser Arg Thr Gly Arg Ala 
12 5 129 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 
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. 20 



50 



60 



65 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

5 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
10 20 ' 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
3£ 40 45 

15 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 ■ 55 60 



Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly 

65 70 75 

Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys 

80 85 90 



Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly 
25 95 100 * 105 

Arg Val Gly Trp Arg Trp lie Arg He Asp Thr Ala Cys Val Cys 
110 115 120 

30 Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 129 

(2) INFORMATION FOR SEQ ID NO: 55: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 

45 Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 



Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Arg Cys Lys Ala Asp Asn Ala 

50 55 60 



Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg Gly Val 
55 65 70 75 



Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr 
80 85 90 

Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg 
95 100 105 

Trp He Arg He Asp Thr Ala Cys Val Cys Thr Leu Leu Ser Arg 
HO H5 120 

Thr Gly Arg Ala 
124 

(2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHAl^ACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Glv Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 -10 15 

10 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

' 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
•J5 35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 ~ 55 60 

Arq His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg 
65 70 75 

Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg Trp lie 
80 85 1 90 

Arq He Asp Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr Gly 
95 100 105 



20 



25 



Arg Ala 
30 107 

<2) INFORMATION FOR SEQ ID N0:57: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:57: 

Glv Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 

Val Cye Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
45 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 "75 

Glv Gly Cys Arg Gly Val Asp Arg Arg Glu Cys Lys Ala Lys Gin 
80 85 90 

Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Gly 
60 95 100 105 

Trp Arq Trp He Arg He Asp Thr Ala Cys Val Cys Thr Leu Leu 
^ 110 H5 120 

65 Ser Arg Thr Gly Arg Ala 

125 126 

(2) I N FORMAT I ON FOR SEQ ID NO: 58: 



50 



55 
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(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly, Glu Leu Ala 
1 5 10 * 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

Val Asp Leu Arg Gly' Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 " 45 



10 



20 



25 



40 



50 



55 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Arg His Ala Asp Ala Gin Gly 
80 85 90 

Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val Cys 
95 100 , 105 



Thr Leu Leu Ser Arg Thr Gly Arg Ala 
30 no H4 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 10 is 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

45 20 25 " 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 "* 45 

Ala Ala Gly Gly Ser Pro Leu His Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 • 75 

• Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 go 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

60 9S 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg He Asp Thr Ala Cys Val 

11° H5 120 

65 Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 130 

(2) INFORMATION FOR SEQ ID NO: 60: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Glv Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 ( 10 15 

Val Cys Asp Ala Val ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30' 

i 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
15, 35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

' 20 Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 



10 



25 



Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 
80 85 90 

His Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 ' 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
30 HO H5 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

35 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
40 (D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
45 1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

50 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 1 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 



55 



Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 
60 80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 ~ 100 105 

65 Gly Arg Phe Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 

110 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 
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(2) INFORMATION FOR SEQ ID NO: 62: ■ 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 antino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:' 

10 Gly Val Ser Glu ThnAla Pro' Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 10 15 



15 



25 



30 



45 



60 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 , 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
20 50 55 1 60 



Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 i * 70 75 

Gly Gly, Cys Arg Gly Val Asp Arg Gin His Trp Val Ser Glu Cys 
80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
110 115 120 



Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
35 125 130 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arq Thr Ala 

50 20 ~ 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 - 45 

55 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 



Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg His His Trp Val Ser Glu Cys 

80 85 90 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
65 95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg He Asp Thr Ala Cys Val 
110 115 120 
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Cye Thr Leu Leu Ser' Arg Thr Gly Arg Al& 
125 " 130 

(2) INFORMATION FOR SEQ ID NO: 64: 

5 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY : ( linear ■ ' . ' • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glv Val ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1,5- 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
& 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
20 35 40 ' 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

t 

25 Cys LyS Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Asn His Trp Val Ser Glu Cys 

80 85 90 

30 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 

Glv Arq Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
35 HO H5 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

40 (2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
45 (D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
50 1 5 10 15 

Val Cys Asp Ala Val ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

55 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 



60 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Thr His Trp Val Ser Glu Cys 
65 ^ 80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 1O0 105 
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Gly Arg Val Gly Trp Arg Ttp lie Arg lie Asp -Thr Ala Cys Val 
110 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
5 125 130 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
10 <A) LENGTH: 130 amino acids ' 1 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 



30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 : 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
.20 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 . 45 

25 Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 



Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Tyr His Trp Val Ser Glu Cys 

80 85 90 



Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
35 95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
110 115 120 

40 Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 

125 130 

(2) INFORMATION FOR SEQ ID NO: 67: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
15 10 15 

55 Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

60 35 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 65 70 - 75 

Gly Gly Cys Arg Gly Val Asp Arg Trp His Trp Val Ser Glu Cys 
80 85 90 
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Lys Ala Lys Gin Ser Tyr Val Arg Ala lieu Thr Ala. Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Asp Thr Ala Cys Val 
5 110 ~ 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

10 (2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
15 ( (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
20 15 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr- Ala 
20 25 30 

25 Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 



30 



45 



60 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

65 70 75 



Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

35 " 80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 

40 Gly Arg Val Gly Trp Arg Trp He Arg He Glu Thr Ala Cys Val 

110 115 120 



Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

55 Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

15 10 15 



Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 



Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
65 50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 
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Gly Gly Cys Arg Gly < Val Asp Arg Arg His Trp Val Ser Glxi Cys 
80 .85 90 

Lye Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
5 95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Asn Thr Ala Cys Val 
110 115 ' 120 

10 Cys Thr Leu Leu Ser i Arg Thr 'Gly Arg Ala 

125 130 

(2) INFORMATION FOR SEQ ID NO: 70: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 1 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 i ■ 10 15 

25 Val Cys ( Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 



30 



45 



50 



55 



60 



Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 



Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 

35 65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 

80 85 90 

40 Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr, Ala Asp Ala Gin 

95 100 105 



Gly Arg Val Gly Trp Arg Trp He Arg He Gin Thr Ala Cys Val 
HO 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

65 35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 

50 55 60 
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Cys Lys Ala Asp Asn Ala Glu Glu Gly 6ly Pro Gly Ala Gly Gly 
, 65 70 75 

Glv Gly Cys Arg Gly Val Asp Arg Arg His Trp Val Ser Glu Cys 
80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp lie Arg lie Tyr Thr Ala Cys Val 
J 110 115 120 

Cys Thr Leu Leu sen Arg Thr Gly Arg Ala 
125 130 

(2) INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 5 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 
20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 
35 40 45 

Ala Ala Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
35 50 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
65 70 75 

Gly Gly Cys Arg Gly Val Asp Arg Arg His Trp Val ser Glu Cys 
80 85 90 

Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 
95 100 105 

Gly Arg Val Gly Trp Arg Trp He Arg He Ser Thr Ala Cys Val 
110 H5 120 

CyB Thr Leu Leu Ser Arg Thr Gly Arg Ala 
50 125 130 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 

15 10 15 

Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala 

65 20 25 30 

Val Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro 

35 40 45 
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Ala Ala QYy Gly Ser 1 Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg 
50 ■ • 55 60 

Cys Lys Ala Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly 
5 65 70 75 

Gly Gly CyB Arg Gly Val Asp Arg Arg His Txp Val Ser Glu Cys 
80 85 9b 

10 Lys Ala Lys Gin Ser' Tyr Val' Arg Ala Leu Thr Ala Asp Ala Gin 

95 100 105 



15 



20 



25 



30 



Gly Arg Val Gly Trp Arg Trp lie Arg lie Thr Thr Ala Cys Val 
110 , 115 120 

Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala 
125 130 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 1 ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

lie Lys Thr Gly 
1 4 

(2) INFORMATION FOR. SEQ ID NO:75: 



(i) SEQUENCE CHARACTERISTICS :■ 
(A) LENGTH: 5 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

40 Glu lie Lys Thr Gly 

1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Glu lie Lys Thr Gly Asn 
1 5 6 

55 (2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
<B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Ser Pro Val Lys 
65 1 4 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4* amiro acids 
, (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Lys Ser Ser Ala , 
1 4 

10 (2) INFORMATION FOR £eQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
t (A) LENGTH: 6 amino acids 
{ B ) TYPE: amino' acid 
15 (D) TOPOLOGY: linear 

(ici) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

Tyr Ala Glu His Lys ser 
20 1 5 6 

(2) INFORMATION FOR S^Q ID NO: 80: 

<i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 7 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear ' 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

Arg Tyr Ala Glu His Lys Ser 

1 5 7- 

(2) INFORMATION FOR SEQ ID NO: 81 t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Arg Tyr Ala Glu His Lys Ser His 
J 5 8 

(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

55 Tyr Ala Glu His Lys Ser His 

1 5 7 

(2) INFORMATION FOR SEQ ID NO: 83: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Ala Asn Arg Thr Ser 
1 5 
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(2) INFORMATION FOR SEQ ID' NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

10 Ala Asn Arg Thr 1 

1 4 

(2) INFORMATION FOR SEQ ID NO: 85: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

Asn Arg Thr Ser 
1 4 

25 (2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Lys Glu Ala Arg 
35 1 4 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Lys Glu Ala Arg Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

lie Asp Asp Lys 
1 * 4 

(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
65 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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Ser Glu Asn Asn 

1 1 4 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(D ) TOPOLOGY: linear , ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 

Thr Ser Glu Asn Asn 1 
1 5 

(2) INFORMATION FOR SEQ ID NO: 91: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
1 20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

25 Thr Ser Glu Asn Asn Lys 

1 5 6 

(2) INFORMATION FOR SEQ ID NO: 92: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Lys Leu Val Gly 
1 4 

40 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Cys Cys Gly Cys Gly Cys Gly Cys Thr Cys Thr Ala Gly Ala Gly 
50 1 5 10 15 

Thr Cys Gly Ala Cys Ala Ala Gly Cys Ala Gly Thr Ala Cys Thr 
20 25 30 

55 Thr Cys Thr Ala Th. Gly Ala Gly Ala Cys Gly Ala Ala Gly Thr 

35 40 45 



60 



Gly Thr 
47 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 amino acids 
65 (B) TYPE: amino acid 

(D ) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
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Cys Cys Gly Cys Gly 1 Cys Gly Cys Thr Cys Thr Ala Gly Ala Gly 
1 , 5 "10 15 

Thr Cys Gly Ala Cys Ala Ala Ala Cys Ala Ala Thr Ala Thr Thr 
5 ' 20 25 30 

Thr Thr Thr Thr CyB Gly Ala Ala Ala Cys Cys Cys Gly Ala Thr 
35 40 4$ 

10 Gly Cys ' 

47 

(2) INFORMATION FOR SEQ ID NO: 95: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
1 (B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: ' 

Cys Cys Gly Cys Gly Cys Gly Cys Thr Cys Thr Ala Gly Ala Gly 
1 5 1 ' 10 15 

25 Thr Cys Gly Ala Cys Ala Ala Gly Cys Ala Gly Thr Ala Cys Thr 

20 25 30 



30 



35 



40 



45 



Thr Cys Thr Ala Thr Gly Ala Gly Ala Cys Thr Ala Ala Gly Thr 
35 40 45 

Gly Thr 
47 

(2) INFORMATION FOR SEQ ID NO:96: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Cys Cys Gly Cys Gly Cys Gly Cys Thr Cys Thr Ala Gly Ala Gly 
1 5 10 15 

Thr Cys Gly Ala Cys Ala Ala Gly Cys Ala Gly Thr Ala Cys Thr 
20 25 30 



Thr Cys Thr Ala Thr Gly Ala Gly Ala Cys Ala Ala Ala Gly Thr 
50 35 40 45 

Gly Thr 
47 

55 (2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Cys Gly Gly Cys Thr Cys Ala Gly Gly Gly Cys Cys Gly Ala Ala 
65 1 5 10 15 

Thr Thr Cys Gly Cys Ala Cys Ala Cys Gly Cys Ala Gly Gly Ala 
20 25 30 
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Ala Gly Thr Ala Thr Cys Thr Ala Thr Cys Cys Thr Thr Ala Thr 
35 '40 45 

5 (2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 1 
10 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

i 

Cys Gly Gly Cys Thr Cys Ala Gly Gly Gly Cys Cys Gly Ala Ala 
15 1 5 10 15 

Thr Thr Cys Ala Cys Ala Thr Ala Cys Ala Cys Ala Ala Gly Cys 
20 25 30 

Glv Gly Thr Gly Thr Cys Ala Ala ,Thr Thr Cys Gly Gly Ala Thr 
35 40 45 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Cvs Glv Gly Cys Thr Cys Ala Gly Gly Gly Cys Cys Gly Ala Ala 
15 10 15 

Thr Thr Cys Gly Cys Ala Gly Ala Cys Gly Cys Ala Gly Gly Ala 
20 25 30 

Thr Gly Thr Ala Thr Cys Gly Ala Thr Gly Cys Thr Ala Ala Thr 
40 35 40 45 

(2) INFORMATION FOR SEQ ID NO: 100: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Cys Gly Gly Cys Thr Cys Ala Gly Gly Gly Cys Cys Gly Ala Ala 
15 10 15 

55 Thr Thr CyB Gly Cys Ala Ala Ala Cys Gly Cys Ala Gly Gly Ala 

20 25 30 

Thr Gly Thr Ala Thr Cys Gly Ala Thr Gly Cys Thr Ala Ala Thr 
35 40 45 

60 
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Claims ' 1 

1 . An isolated nucleic acid encoding NT-4, 

2. An isolated nucleic acid of claim 1 comprising the nucleotide sequence shown 
in Figure 1 for mature human NT-4. 

5 3. A vector comprising the nucleic acid of claim 1. 

4. A host cell comprising the nucleic acid of claim 1 . 

5. A composition comprising NT-4 from an animal species, which composition is 
free of contaminating polypeptides of that animal species. 

< 6. A composition of claim 5 wherein the NT-4 is from human. • 

10 7. A nucleic acid that hybridizes to DNA encoding NT-4 under stringent conditions, 

excluding nucleic acid encoding NGF, BDNF, or NT-3. 

8. A nucleic acid of claim 7 comprising the nucleotide sequence shown in Figure 
3 for human NT-40. 

9. A nucleic acid of claim 7 comprising the nucleotide sequence shown in Figure 
15 4 for human NT-4p. 

10. A nucleic acid of claim 7 comprising the nucleotide sequence shown in Figure 
5 for human NT-4 A. 

11. A composition comprising NT-4 linked to an immunogenic polypeptide or a non- 
proteinaceous polymer. 

20 12. A pharmaceutical composition comprising an effective amount of NT-4 in a 

pharmaceutical^ acceptable carrier. 

13. A composition of claim 12 further comprising NGF, BDNF, or NT-3. 

14. An antibody that is capable of binding NT-4, but that is not capable of binding 
NGF, BDNF, or NT-3. 

25 15. A monoclonal antibody capable of binding NT-4. 

1 6. A monoclonal antibody of claim 1 5 which does not cross-react with NGF, BDNF, 
or NT-3. 

17. A method for treating a neurodegenerative disease or damaged nerve cells 
comprising administering to a mammal an effective amount of NT-4. 
30 1 8. A method of claim 1 7 wherein the mammal is human. 

19. A method of claim 17 wherein an effective amount of NGF, BDNF, or NT-3 is 
also administered to the mammal. 

20. A method of claim 1 7 wherein the neurodegenerative disease is Huntington's 
chorea, Alzheimer's disease, ALS, or Parkinson's disease and the damaged nerve cells are due 

35 to trauma. 

21 . A method for detecting NT-4 in vitro or in vivo comprising employing an antibody 
of claim 14. 

22. A method for purifying NT-4 comprising passing a mixture of NT-4 over a column 
to which is bound an antibody of claim 14. 
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23. A method for producing NT-4, comprising cuituring the host cell of claim 4 and 
recovering ISJT-4 from the host cell culture. 

24. The method of claim 23 wherein the NT-4 is recovered from the host cell culture 
medium. 
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ATG CTC CCT CTC CCC TCA 'TGC TCC CTC CCC ATC CTC CTC CTT TTC 
Met Leu Pro Leu Pro Ser Cys Ser Leu Pro He Leu Leu Leu Phe 

CTC CTC CCC ACT GTG CCA ATT GAG TCC CAA CCC CCA CCC TCA ACA TTG 
Leu Leu Pro Ser Val Pro lie Glu Ser Gin Pro Pro Pro Ser Thr Leu 

CCC CCT TTT CTG GCC CCT GAG TGG GAC fcTT CTC TCC CCC CGA GTA GTC 
Pro Pro Phe Leu Ala Pro Glu Trp Asp Leu Leu Ser Pro Arg Val Val 

* 

, CTG TCT AGG GGT GCC CCT GCT GGG CCC CCT CTG CTC TTC CTG CTG GAG • 
Leu Ser Arg Gly Ala Pro Ala Gly Pro Pro Leu Leu Phe Leu Leu Glu 

, GCT GGG GCC TTT CGG GAG TCA GCA GGT GCC CCG GCC AAC CGC AGC CGG 
Ala Gly Ala Phe Arg Glu Ser Ala Gly Ala Pro Ala Asn Arg Ser Arg 

I 

CGT GGG GTG AGC GAA ACT GCA CCA GCG AGT CGT CGG GGT GAG CTG GCT 

Arg Gly Val Ser Glu Thr Ala Pro Ala Ser Arg Arg Gly Glu Leu Ala 
1 10 , 

GTG TGC GAT GCA GTC AGT GGC TGG GTG ACA GAC CGC CGG ACC GCT GTG 
Val Cys Asp Ala Val Ser Gly Trp Val Thr Asp Arg Arg Thr Ala Val 
20 30 

GAC TTG CGT GGG CGC GAG GTG GAG GTG TTG GGC GAG GTG CCT GCA GCT 

Asp Leu Arg Gly Arg Glu Val Glu Val Leu Gly Glu Val Pro Ala Ala 

40 

GGC GGC AGT CCC CTC CGC CAG TAC TTC TTT GAA ACC CGC TGC AAG GCT 

Gly Gly Ser Pro Leu Arg Gin Tyr Phe Phe Glu Thr Arg Cys Lys Ala 
50 60 

GAT AAC GCT GAG GAA GGT GGC CCG GGG GCA GGT GGA GGG GGC TGC CGG 
Asp Asn Ala Glu Glu Gly Gly Pro Gly Ala Gly Gly Gly Gly Cys Arg 

70 

GGA GTG GAC AGG AGG CAC TGG GTA TCT GAG TGC AAG GCC AAG CAG TCC 
Gly Val Asp Arg Arg His Trp Val Ser Glu Cys Lys Ala Lys Gin Ser 
80 90 

TAT GTG CGG GCA TTG ACC GCT GAT GCC CAG GGC CGT GTG GGC TGG CGA 
Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin Gly Arg Val Gly Trp Arg 
100 . 110 

TGG ATT CGA ATT GAC ACT GCC TGC GTC TGC ACA CTC CTC AGC CGG ACT 
Trp He Arg He Asp Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr 

120 

ggc cgg gcc/tga\; 

Gly Arg Alal OP* J 
130^ ' 
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FIGURE 3 



1 C GAG AGA TGC TCT GAG AGA TGC TCC CAC TCC CCC AGG CTC CCT CCG 
1 Glu Arg Cys Ser Glu Arg Cys Ser His Ser Pro Arg Leu Pro Pro 

4 7 CAT CCC CCT CAT TTT CCT CCT CCC CAG TGT GTC AAT GGA GTC CTA ACC 
16 His Pro Pro His Ph<* Pro Pro Pro Gin Cys Val Asn Gly Val L<*u Thr 

95 CCA TCC TCG ACA TTG TCG CCT TTT CCT CCT CCA GAG TGG GAC CTT CTT 

3 2 Pro £er Ser Thr Leu Ser Pro Phe Pro Pro Pro Glu Trp Asp Leu Leu 

14 3 TTC CCC CGA GTG GTC CTG TCT AGG GGT GCC GCT GCC GGG CCC CCT CTG 

4 8 Ph£ Pro Arg Val Val Leu Ser Arg Gly Ala Ala Ala Gly Pro Pro Leu 

191 GTC TTC CTG CTG GAG ACT GGA GCC TTT CGG GAG TCA GCA GGC GCC CGG 
64 Val t>he Leu Leu Glu Thr Gly Ala Phe Arg Glu ,Ser Ala Gly Ala Arg 

23 9 GCC AAC CGC AGC CAG CGA GGG GTG AGC GAT ACT TCA CCG GCG AGT CAT 
80 Ala Asn Arg Ser Gin 1 Arg Gly Val Ser Asp Thr Ser Pro Ala Ser His 

287 CAG GGT GAG CTG GCC GTG TGC GAT GCA GTC AGT GTC TGg' GTG ACA GAC 

96 Gin Gly Glu Leu Ala Val Cys Asp Ala Val Ser Val Trp Val Thr Asp 

335 CCC TGG ACT GCT GTG GAC TTG GGT GTG CTC GAG GTG GAG GTG TTG GGC 
112 Pro Trp Thr Ala Val Asp Leu Gly Val Leu Glu Val Glu Val Leu Gly 

383 GAG GTG CCT GCA GCT GTC GGC AGT TCC CTC CGC CAG CAC TTC TTT GTT 
128 Glu Val Pro Ala Ala Val Gly Ser Ser Leu Arg Gin His Phe Phe Val 

431 GCC CGC TTC GAG GCC GAT AAA TCT GAG GAA GGT GGC CCG GGG GTA GGT 
144 Ala Arg Phe Glu Ala Asp Lys Ser Glu Glu Gly Gly Pro Gly Val Gly 

479 GGA GGG GCT GCC GCC GGG GTG TGG ACC GGG GGG CAC TGG GTG TCT GAG 
160 Gly Gly Ala Ala Ala Gly Val Trp Thr Gly- Gly His Trp Val Ser Glu 

527 TGC AAG GCC AAG CAG TCC TAT GTG CGG GCA TTG ACC GCT GAT GCC CAG 
176 Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala Asp Ala Gin 

575 GGC CGT GTG GAC TGG CGA TGG ATT CAA ATT GGC ACA GCC TGT GTC TGC 
192 Gly Arg Val Asp Trp Arg Trp lie Gin lie Gly Thr Ala Cys Val Cys 

623 ACA CTC CTC AGC CGG ACT GGC CGG GCC TGA GACTTATA CCCAGGAACT 
208 Thr Leu Leu Ser Arg Thr Gly Arg Ala OP* 

671 GGTCAGGCAG AAAAA 
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1 ACTGGAGCGC AGCACCACGCi CCAGCTAATT TTGGTATTAT CAGTAGAG^T GTTGTTTCAC 

61 AGTGTTGGCC AGGCTGCTCT CAAACTCCTG ACCTCAAGTC AAACACCCGC CTCAGCCTCC 

121 CAAAGTGCTG GGACTACAGG TGTGAGCCAT AGTGCCTGAC CTG TAG TTG T TGAATATTTA . 

181 TTATTAATCT ACAAGTTGGG TGTGATGCAA GTCCTTTATA TGGAGTCCCC CAAACTTCTA 

241 G AGC AAG GGC TTC CCC ATA ATC CTG GCA GGC AGG CCT CCC CTG GGG TTC 
1 Ser Lys Gly Phe Pro lie He Leu Ala Gly Arg Pro Pro Leu Gly Phe 

290 CCA ACT 'TCT GAC ' CCC ACT GAA GTG TTT ATC TTC TTC CCT AAT CCC AGC CTC 
17 Pro Thr Ser Asp Pro Thr" Glu Val Phe He Phe Phe Pro Asn Pro Ser Leu 

341 CTT TTC CCT GTC TCC ATG TGC TCT GAG AG A TGC TCT GAG AGA TGC TCC TGC 
34 Leu Phe Pro Val Ser Met Cys Ser Glu Arg Cys Ser Glu Arg Cys Ser Cys 

392 TCC CCC AGG CTC CCT CCG CAT CCC CCT CAT TTT CCT CCT CCC CAG TGT GTC 
51 Ser Pro Arg Leu Pro Pro His Pro ' Pro His Phe Pro Pro Pro Gin Cys Val 

443 ATT GGA GTC CTA ACC CCA TCC TCG ACA TTG TCG ,CGT TTT CCT CCT CCA GAG 
6 8 He Gly Val Leu Thr Pro Ser Ser Thr Leu Ser Arg Phe Pro Pro Pro Glu 

4 94 TGG GAC CTT CTT TTC CCC CGA GTG GTC CTG TCT AGG GGT GCC GCT GCC GGG 
85 Trp Asp Leu Leu Phe Pro Arg Val Val Leu Ser Arg Gly Ala Ala Ala Gly 

54 5 CCC CCT CTG GTC TTC CTG CTG GAG ACT GGA GCC TTT CGG GAG TCA GCA GGC 
102 Pro Pro Leu Val Phe Leu Leu Glu Thr Gly Ala Phe Arg Glu Ser Ala Gly 

596 GCC CGG GCC AAC CGC AGC CAG CGT GGG GTG AGC GAT ACT TCA CCG GTG AGT 
119 Ala Arg Ala Asn Arg Ser Gin Arg Gly Val Ser Asp Thr Ser Pro Val Ser 

647 CAT CAG GGT GAG CTG GCC GTG TGC GAT GCA GTC ACT GTC TGG GTG ACA GAC 
136 His Gin Gly Glu Leu Ala Val Cys Asp Ala Val Thr Val Trp Val Thr Asp 

698 CCC TGG ACT GCT GTG GAC TTG GGT GTG CTC GAG GTG GAG GTG TTG GGT GAG 
153 Pro Trp Thr Ala Val Asp Leu Gly Val Leu Glu Val Glu Val Leu Gly Glu 

74 9 GTG CCT GCA GCT GGC AGC AGT TCC CTC CGC CAG CAC TTC TTT GTT ACC CGC 
170 Val Pro Ala Ala Gly Ser Ser Ser Leu Arg Gin His Phe Phe Val Thr Arg 

800 TTC GAG GCC GAT AAA TCT AAG GAA GGT GGC CCG GGG GTA GGT GGA GGA CCT 
187 Phe Glu Ala Asp Lys Ser Lys Glu Gly Gly Pro Gly Val Gly Gly Gly Pro 

851 GCC GCC GGG GTG TGG ACC GGG GGG CAC TGG GTG TCT GAG TGC AAG GCC AAG 
204 Ala Ala Gly Val Trp Thr Gly Gly His Trp Val Ser Glu Cys Lys Ala Lys 

902 CAG TCC TAT GGG CGG GCA TTG ACC ACT GAT GCC CAG GGC CGT GTG GAC TGG 
221 Gin Ser Tyr Gly Arg Ala Leu Thr Thr Asp Ala Gin Gly Arg Val Asp Trp 

953 CGA TGG ATT CAA ATT GGC ACT GCC TGT GTC TGC ACA CTC CTC AGC CGG ACT 
238 Arg Trp He Gin He Gly Thr Ala Cys Val Cys Thr Leu Leu Ser Arg Thr 

1004 GGC CGG GCC TGA GACTT ATACCCAGGA ACTGGTCAGG CAGAAAAAGA ACAGAGCTGG 

255 Gly Arg Ala OP* 
1061 ATGCTGAGAG ACCTCAGGGT TGGCCCAGCT GCTCTACGGA CGGACCCCAG TTGGGGAACT 

1121 CATCAAATCA TCGCAAAATC TCAACTGTCT GAATTTGAGC TCAATCTCTG TAGGATGGGT 

1181 GCAACAATGT 
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1 TTTGAACTCC TGACCTCAAG TCAAACACCG CCTCAGCCTC CCAAAGTGCT GGGACCACAG 

61 GTGTGAGCCA TAGTGCCTGA CCTGTAGTTG TTGAATATTT ATTATTAATC TACAAGTTGG 

121 GTGTGATGCA AGTCCCATAC ATGGAGTCCC CCAAACTTCT AGAGCAAGGG CTTCCCCATA 

181 ATCCTGGCAG GCAGGGCTCC CCTGGGGTTC CCAACTTGTG ACCTCACTGA AGTGTTTATC 

241 CTCTTCTCTA ATCCCAGCCT CCTTTTCCCT GTTTCCATGT CCTCTGAGAG ATGCTCCCGC 

301 TCCCCCAGGC TCCCTCTGCA TCCCCCTCAT TTGCTTCCTC CCCAGTGTGT CAATGGAGTC 

361 CTAA CCC CCA CCC TTG ACA TTG TCC CCT TTT CCT CCT CCA GAG TGG 
1 Pro Pro Pro Leu Thr Leu Ser Pro Phe Pro Pro Pro Glu Trp 

407 GAC CTT ATT TTC CCC CAA GTG GTC CTG TCT AGG GGT GCC GCT GCC GGG 
15 Asp Leu lie Phe Pro Gin Val Val Leu Ser Arg Gly Ala Ala Ala Gly 

455 CCC CCT CTG GTC TTC CTG CTG GAG ACT GGG GCC TTT TGG GAG TCA GCA 
31 Pro Pro Leu Val Phe Leu Leu Glu Thr Gly Ala Phe Trp Glu Ser Ala 

503 GGC ACC CGG GCC AAC CGC AGC CAG CGA GGG GTG AGC GAT ACT TCA CCG 
47 Gly Thr Arg Ala Asn Arg Ser Gin Arg Gly Val Ser Asp Thr Ser Pro 

551 GCG AGT CAT CAG GGT GAG CTG GCC GTG TGC GAT GCA GTC AGT GTC TGG 
63 Ala Ser His Gin Gly Glu Leu Ala Val Cys Asp Ala Val Ser Val Trp 

599 GTG ACA GAC CCC CGG ACC GCT GTG GAC TTG GTT GTG CTC GAG GTG GAG 
79 Val Thr Asp Pro Arg Thr Ala Val Asp Leu Val Val Leu Glu Val Glu 

647 GTG TTG GGT GAG GTG CCT GCA GCT GGC AGC AGT TCC CTC CAC CAA CAC 
95 Val Leu Gly Glu Val Pro Ala Ala Gly Ser Ser Ser Leu His Gin His 

695 TTC TTT GTC ACC TGC TTC AAG GCC GAT AAC TCT GAA GAA GGT GGC CCA 
111 Phe Phe Val Thr Cys Phe Lys Ala Asp Asn Ser Glu Glu Gly Gly Pro 

743 GGG GTA GGT GGA GGG GCT GCC GCT GGG GTG TGG ACC GGG GGG CAC TGG 
127 Gly Val Gly Gly Gly Ala Ala Ala Gly Val Trp Thr Gly Gly His Trp 

791 GTG TCT GAG TGC AAG GCC AAG CAG TCC TAT GTG CGG GCA TTG ACC GCT 
14 3 Val Ser Glu Cys Lys Ala Lys Gin Ser Tyr Val Arg Ala Leu Thr Ala 

83 9 GAT GCC CAG GGC CGT GTG GAC TGG CGA TGG ATT CAA ACT GGC ACA GCC 
159 Asp Ala Gin Gly Arg Val Asp Trp Arg Trp He Gin Thr Gly Thr Ala 

887 TGT GTC TGC ACA CTC CTC AGC CGG ACT GGC CGG GCC TGA GACTT 
175 Cys Val Cys Thr Leu Leu Ser Arg Thr Gly Arg Ala OP* 

931 ATACCCAGGA ACTGGTCAGG CAGAAAAAGA ACAGAGCTAG G 
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